Gene Tery_2759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2759 
Symbol 
ID4244792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4271969 
End bp4273429 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content41% 
IMG OID638107818 
Producthypothetical protein 
Protein accessionYP_722415 
Protein GI113476354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000124104 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.929544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCG GTCAAAAAAC TATGATGGTA CTATTGAGTC TAGCTCTAAA CTTGGTATTG 
GTAGCACCTC CTGTGAAAGC TGAACCAAAA GTTACCATAA CCCCATCCAA TTTAACCGTA
GTAGGAACTA AATGTCCAGT TTTTTTTTAC TGTCCTGTCC TGAAGCGCCG TCTAGTTTTA
CAAACTAATC AGGCCATCGC AAACTTACAA ATACTCAGCC TAGATTTGGA TCGAACCGAT
GGAACCACCG TTTTACAAGG TAGCGCGATT AATCCAATTT TGTCCACAAC CTCAGTGGAA
CCAAATCAAC CACTGACTAT CCCAGTTGAA TTCAACTTAA ACGATGCCAA AAGTGGTGAA
TTTAGTGGTG TTTTACTAGC CATCCATTCT GATGGACAAC TGGTTATACC TATAATTGTA
CGAGTAAAAG ATCATTGGCT GGGACCAATA TTCCTCTTGC TATTAGGCGT TATGGTAAGT
ATTGGCATGT CTGCCTACCG TACTGATGGC AGAAACCGTG ATGAAATTGT GGTGCTAGTT
AGCCGCATTC GCACTCAGAT GAAAGCTGAC CCAGAACTAG TAAAATCATT TCAGGTAAAA
ATCAATGGAT ATCTAATTGA TGTAGACACC GATTTAACAA ACAAGCGATG GGATGAGGCA
AAGCAAGCTG TAGCAAAAGC TCAAACAACT TGGGATAAAT GGCGAAAAAG TAGAGAAGAC
TGGTTAGCTT TATCCGAATT TGAGTCTTCC TTTCTAGATC TAGATAATCT CACCACTGAT
GCTCCCTATG TACAAACAGT AGGTAGCTAC TTAGAGAATA TTAAGCGACA AACAGCTGAT
AAAGAAAATC CCGAACAGTT GAGGAAAGAG TTAAATGATT TGCGGCAACA ACTGGTTCGT
TACCTGGGAG GTGAAGCTAA ACTAGAGAGA TTCGATAATC TTAGAAATGA GTTAACTGGG
TCAGCACAAC AAGAGCAAGC ACTTAGAGAT ATATCTCAAT ACTTGCAGCA GGAATTAAAT
AATCTTTCTC CAACTGAGCT AGAAGCTTTC CAAAGATGGG AGCAGGAGAT TGATAATGAA
CTCAAACAAC TAGACCAAGC CATTAAACAA CAACAAATTT CCAACAATGA AGCTCAAAGT
AACCTCAGCA TAACTTCCAG GGGTGTTAGT AGCACCTATC CTGCCCGCCT TCCTAACCCA
GTTCCTGATG TTCAACCTCT ACAACTTAAT CCTGTAGGGT CTGCCCGCAA TCTATTTTTG
TTTCAATTAT TAAGTTATAC AATTGCAATT TTCCTCCTGG CGGGTGCAGG CTTTAGACAA
CTTTATGTCA CTCAACCCAC CTTTGGAGCA AATCTTTGGA CAGATTATTT TGCTCTGCTA
GCTTGGGGTT TTGGAGCTGA GGCTACTCGC GATGCCGTTA CCAAAACTAT CCGCGAGTGG
AAATTGCCCG GACTCAAATG A
 
Protein sequence
MMRGQKTMMV LLSLALNLVL VAPPVKAEPK VTITPSNLTV VGTKCPVFFY CPVLKRRLVL 
QTNQAIANLQ ILSLDLDRTD GTTVLQGSAI NPILSTTSVE PNQPLTIPVE FNLNDAKSGE
FSGVLLAIHS DGQLVIPIIV RVKDHWLGPI FLLLLGVMVS IGMSAYRTDG RNRDEIVVLV
SRIRTQMKAD PELVKSFQVK INGYLIDVDT DLTNKRWDEA KQAVAKAQTT WDKWRKSRED
WLALSEFESS FLDLDNLTTD APYVQTVGSY LENIKRQTAD KENPEQLRKE LNDLRQQLVR
YLGGEAKLER FDNLRNELTG SAQQEQALRD ISQYLQQELN NLSPTELEAF QRWEQEIDNE
LKQLDQAIKQ QQISNNEAQS NLSITSRGVS STYPARLPNP VPDVQPLQLN PVGSARNLFL
FQLLSYTIAI FLLAGAGFRQ LYVTQPTFGA NLWTDYFALL AWGFGAEATR DAVTKTIREW
KLPGLK