Gene Tery_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3840 
Symbol 
ID4242291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5931920 
End bp5933398 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content32% 
IMG OID638108772 
Producthypothetical protein 
Protein accessionYP_723355 
Protein GI113477294 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.83902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.693476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTAA AATTAAAACG ACCAATTTTA GTAGGGGGAA TAGGTTTATC TCTACTGTTA 
TGGCTACTCT CAGAGGTCCA AAATTTTATA ACAGACAATA GTGAACCTAC TATTTTAGGA
ATTATAGTAG TCAGTCTGGG GGTTTGGTTA TTAAAAAGAC CAAAATATCT ACCTTCCCAT
AAACCAAGTA TAATCGTATC ACCTACAAAA CAAGCAGCAG AAGAGGCGAT CGGTTTATTA
TCTATTACTA TAGATAAGGT TTCTATGACA GTAGAAGGTA TTAATAACTC TGAAAAAATA
AGTAATGATA TTACTCAACT ACATCAGCAA GTAAAAATTA TTACTCAGGA ATTAGAAAGG
CAAGAATTAA GTATAGTTAT TACAGGCAAT AAAGGAGTTG GCAAAACTAC TTTTACTGAA
ATATTAAAGT CTCAATGGAA CTCTAAAAAA TCACCAAAAA TAAAAGTTGT TGATATAACT
TGGGAGCCAG AATGGGCAAC AAACAATGAA TATGCTAAAG TTAATCCTCT ATCGCCTTAC
GATTTGATAT TATTTTTGAC TACAGGTGAC TTAATAGATT CAGAATTTCA AGCTTTATCA
AAACTAACAA CTCTTGGTCA ACGTTTTATC TTAATTTGGA ATAAACAAGA CTTATATTTA
CCAGACCAAA AACCACAAGT CATCCAAAAA ATTAAAGAGA CATTATCTAC TATTAACTCA
GAGAAAAATT TAGTAGGAAT TTCTGTAAAA CCTAACCCTA TTAAAGTGCG AAAATATCAG
CAAGATGGCA CTATTCAAGA ATCTATAGAA CAACCATTAC CAGAAATATC TCAATTAACA
GAAAAACTAA ACCAACTATT AGAGGAAGAA AGAGAAAAGT TAGTCTGGGC AACTACTATA
AGAAAAGCGG AAATATTTAG ATTAGAAGCT CAAAATATTT TAAATAAAAT CAGAAAAGAA
CGAGCGCTTC CTGTCATTGA AAAATATCAA TGGATAGCTG CTGCTACAGC ATTTGCTAAT
CCAGTTCCAG CTTTAGACTT ATTAGCAACA GCAGCTATAA ATACTCAATT AGTAGTAGAC
TTAAGTGCTA TATATGAGCA AAAATTTTCT ATAGAAAAAG GTAAACAAGT AGCAGGTACT
ATGGCAGAGT TAATGTTAAA ACTAGGACTA GTAGAACTAT CTACCAAAAC ATTAACTACT
CTACTTAAAA GTAACAGTTT AACTTTTGTT GCTGGTGGTG CATTTCAAGC AGTAAGTGCT
GCTTATTTGA CAAGAGTAGC AGGTATGAGT TTAGTAGAAT ATTTGACTAC TCAAGCAGAT
ACTAATTCCG TGAATATTGA TCAATTAGGA ACAATTATTC AAGGTGTATT TAGTAAAACC
CAAGAAAATA ATTTCTTGAA GTCTTTTGTT ACTCAGGTAA TGAGTCATAT TTTGCCACAG
GGAAAACAAT TAGAATTTGT CTCATCTCCG GCGCAATAA
 
Protein sequence
MAVKLKRPIL VGGIGLSLLL WLLSEVQNFI TDNSEPTILG IIVVSLGVWL LKRPKYLPSH 
KPSIIVSPTK QAAEEAIGLL SITIDKVSMT VEGINNSEKI SNDITQLHQQ VKIITQELER
QELSIVITGN KGVGKTTFTE ILKSQWNSKK SPKIKVVDIT WEPEWATNNE YAKVNPLSPY
DLILFLTTGD LIDSEFQALS KLTTLGQRFI LIWNKQDLYL PDQKPQVIQK IKETLSTINS
EKNLVGISVK PNPIKVRKYQ QDGTIQESIE QPLPEISQLT EKLNQLLEEE REKLVWATTI
RKAEIFRLEA QNILNKIRKE RALPVIEKYQ WIAAATAFAN PVPALDLLAT AAINTQLVVD
LSAIYEQKFS IEKGKQVAGT MAELMLKLGL VELSTKTLTT LLKSNSLTFV AGGAFQAVSA
AYLTRVAGMS LVEYLTTQAD TNSVNIDQLG TIIQGVFSKT QENNFLKSFV TQVMSHILPQ
GKQLEFVSSP AQ