Gene Tery_4108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4108 
Symbol 
ID4245622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6336267 
End bp6337910 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content38% 
IMG OID638109009 
ProductCopG-like DNA-binding 
Protein accessionYP_723589 
Protein GI113477528 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.996813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAAA AACGTGGTAG TGCCACCTCT TATAGAGAAA CCAAGGTTAA AGTAGGTGTG 
TCTCTTACTC CCACTAGCGC TGAGAAGCTA AAAGAAGTAG CAAAAGAACT TGGATTATCA
AGGTCTGAGC TACTTGAGCG TATTGCTAGA GGAGACCTTG CTGTTTTGAG AGAGGCGGCA
CAGATAACAG CGAATTTAAA TAATCAGTTG GTTCCAGTCA ACAATGTGCT GGGTAAAAAT
GAACTCCCTG AAAGTGGTAT AGATGTTGTG ACAATTCCAG AACCATTAAA TGAGTCTGTA
GAAGCACAAT ATTTATTATC AGAAAGTTAT CAAATTTTAC AACAAGAGTT GGAATCTCAG
ATGGCACAGG TTAAGCATTT GGAAAAGCAA ATGGTTAATA TGGTGTCAAA AGAAAGTTAT
GAAACTTTAC AAAAACAATT AGAAGAAGAT AAGAAAGAAA TTACAAGTTC TCGCCAAAAG
TTAGCTCAAC TCAAGCAATT AGAAGAGCAA GTGGCTTTAA TGATACCAAG GGAAAGCTAC
AAAACTCTAG AGCAAGAGTC TCAAGAGCAA AAATTGCAAT TGGAACTTTT GGAAGAGCAA
GTAGGTTCAA TGGTGCCCAA AGAAATTCAT GATTCTTTGC AGTACCAGTC ACAACAGCAA
GAAACTCAAA TTCAACAACT AGAATTACAA GTAGCTTCTA TGGTTTCCCA AGAAGTATAC
GAAATTCTAC AACAAGAGTC TCAAGAACAA AAAGTTAGGT TACAACTGTT AGAAGATCAA
GTAGCTTGTA TGGTGTCTCA AGGAGTTCAT AATTCCTTGC AGGAACAGTC ACAACAGCAA
GAAACTCAAA TTCAACAACT AGAGTCACAA GTAGCTTCTA TGGTGTCCCA AGAAGTATAC
AAAATTCTAC AGCAGGAGTC TCAAGAACAA AAAGCGAAGT TGGAGCTGTT GGAAGAGCAA
GTAGATTCTA TGGTGTCCCC AGAAATTCAC GATTCTTTGC AGCAGCAGTC ACAACAGCAA
GAAACTCAAA TTCAGCAACT AGAATCACGA GTAACTTCTA TGGTGTCCCA AGAAGTATAC
AAAATTCTAC AGCAGGAGTC TGAAGGACAA AGAGTCAAGT TGGAACTGTT GGAAGAGCAA
GTAGCTTGTA TGGTGTCCCA AGAAATTTAT AATTCTTTGC AGCACCGATC ACAGCAGCAA
GAAACTCAAA TTCAGCAACT AGAATCACAA GTAGCTTCTA TGGTTTTACA CGAAAGTTAT
GATGCTTTGC AACGTCACTC AGAAGAACAG GAAAATAAGT TGCAAGACCT AGAAGAACAA
GTGGCCACAA TGGTGACTAG AACAGTCTAT GAGACTTTGC AAAAACAGTT ACAAGAACAA
GTGAATAACT TGCAATACAT GGAATCTATG CTTGGTTCAA TGATATCCCA GCAAACCTAT
GAAGCTTTAA AAAAAGAGTC AGAATTTCAA AAACAAATAA TTGCAGATTA TCAAAATAAG
CTGACTCAGT TACAAGAGAA GTATCATAGC CAATCTGAGC TATTAGAGAC TTCTAAAAAA
GAAGTTTCTC AGTTAAAAAC CCCAGCTATG ATTGGAGAAA ATCGATATCT TAACAAGTGG
CAATATCGGA CTTTTTCCAG ATAA
 
Protein sequence
MTKKRGSATS YRETKVKVGV SLTPTSAEKL KEVAKELGLS RSELLERIAR GDLAVLREAA 
QITANLNNQL VPVNNVLGKN ELPESGIDVV TIPEPLNESV EAQYLLSESY QILQQELESQ
MAQVKHLEKQ MVNMVSKESY ETLQKQLEED KKEITSSRQK LAQLKQLEEQ VALMIPRESY
KTLEQESQEQ KLQLELLEEQ VGSMVPKEIH DSLQYQSQQQ ETQIQQLELQ VASMVSQEVY
EILQQESQEQ KVRLQLLEDQ VACMVSQGVH NSLQEQSQQQ ETQIQQLESQ VASMVSQEVY
KILQQESQEQ KAKLELLEEQ VDSMVSPEIH DSLQQQSQQQ ETQIQQLESR VTSMVSQEVY
KILQQESEGQ RVKLELLEEQ VACMVSQEIY NSLQHRSQQQ ETQIQQLESQ VASMVLHESY
DALQRHSEEQ ENKLQDLEEQ VATMVTRTVY ETLQKQLQEQ VNNLQYMESM LGSMISQQTY
EALKKESEFQ KQIIADYQNK LTQLQEKYHS QSELLETSKK EVSQLKTPAM IGENRYLNKW
QYRTFSR