Gene Tery_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2006 
Symbol 
ID4243433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3124502 
End bp3126070 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content36% 
IMG OID638107120 
Producthypothetical protein 
Protein accessionYP_721727 
Protein GI113475666 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.6079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTAATT TAAAAAAGTT GGAAGATTCT ATTGTTTTAA TTGCAAGTGC TAAAGATAAT 
AGAAAAAATG TTATTGGTAC TGGATTTATA TTTCATAAGG AACAGAATTG TACTTATCTA
CTTACTTGTG CTCATGTTGT TGAGGATGTT GGTGGGGCAG ACAACATAAA GGTAAATAAT
ATTCCGGCGG AAGTTATAAA GATAGGAGAT ATTCAGGGTT TTGATTTAGC TGTTTTAAAG
GTGAATGAAA GTTTTTCAGC TCCATCTTTA AGTTTAATGA TTTTATATGG AGAGGAAGAA
AAAAATTTGC TTGTGAAAAT TCCTGGTTAT TATCTTTGGG GTCAAAATAA TGCACTTTGT
CGTCAAACAA TAAAAGGAAG AATGACAGTA GAGGTTGATG GAGAAAGGGC ATTTCAATTA
ATAGAAAATA TGCCAGAAGA TGTGGCGGTT GAAAAGTTAG AGATAGAAAA AGGAAGTCTC
CGTTCAGGTT ATAGTGGTTC TCCTGTTATT GATATTAATA CAGGACTAGT TATCGGTCTA
GTTACTCATA AAATTGATGT TGATGGAGTA GGAATGTTTG GTAGAGCAAT ATCAATAGAG
GCTTTAGAAA AAATTTGGTT TGAAATAACT GATGAAGTCT TTAAAAAAAT TAAACGAGAA
TCAAAAACAA TAGAAGTTTT AACTAGTACA AATATTGAAG ATAATTTAGA AAGTAAAGTT
ACATTATCAA AAGAACTAGA CAAGGGAGAA TTGTTTACTT TTGCAGTTGT AAGTGTTAAT
AATTTTGGCA GTATTGTTAA TCGTAGTCAA GGGGGTGCTA GACAGAAAAT AGAAAATTTA
GGTAATGGAA TTAAATTGGA AATGGTTTAT ATTCCTGGGG GTACTTTTAC TATGGGTTCT
CCTGAAAGTG AAGTGGATAG CAATAATAAT GAACGGCCCC AACATGATGT AACTGTCCCT
AACTTTTTTA TGGGCAAATA TCCAGTTACT CAAGGACAGT GGAAAGCGAT CGCCTCCTGC
GCAGACTTGA AAGTAAAATT AGACCTAGAG CTAGAACCAT CTTATTTTAA AGAACCATAC
CGAAATATAG ATAGATGGCA GAGACCAGTT GAGGAGGTTA TTTGGTACCA AGCTATAGAG
TTTTGCCAAA GGCTATCGAA ATTAACAGGA AAGAATTATA GACTGCCCAG TGAAGCAGAA
TGGGAATATG CTTGCCGTGC AGGAACAACT ACACCTTTCT ATTTTGGGGA AACTATAACG
CCTGAGTTAG TTAACTATAA TGACAAATAT GTTTATGGTA GTGCACCAAA AGGAGAATAT
AGAGAACAAA CAACTCCCGT AGGCCAATTT CCGGCTAATG CTTTTGGGTT ATACGATATG
CACGGAAATG TGTGGGAGTG GTGTGCTGAT CAATGGCATC GTAACTATAA TGGTGCTCCT
ACAGATGGCA GTGTTTGGCT AGATGGAGAT AAAGAGATAA CATGTGTGCG GGGCGGTTCC
TGGGACGACT TTCCTAATTC TTGCCGTTCT GCGTTTCGCT TGAACTATGT TAGGCGCGAC
TACCGCTAG
 
Protein sequence
MVNLKKLEDS IVLIASAKDN RKNVIGTGFI FHKEQNCTYL LTCAHVVEDV GGADNIKVNN 
IPAEVIKIGD IQGFDLAVLK VNESFSAPSL SLMILYGEEE KNLLVKIPGY YLWGQNNALC
RQTIKGRMTV EVDGERAFQL IENMPEDVAV EKLEIEKGSL RSGYSGSPVI DINTGLVIGL
VTHKIDVDGV GMFGRAISIE ALEKIWFEIT DEVFKKIKRE SKTIEVLTST NIEDNLESKV
TLSKELDKGE LFTFAVVSVN NFGSIVNRSQ GGARQKIENL GNGIKLEMVY IPGGTFTMGS
PESEVDSNNN ERPQHDVTVP NFFMGKYPVT QGQWKAIASC ADLKVKLDLE LEPSYFKEPY
RNIDRWQRPV EEVIWYQAIE FCQRLSKLTG KNYRLPSEAE WEYACRAGTT TPFYFGETIT
PELVNYNDKY VYGSAPKGEY REQTTPVGQF PANAFGLYDM HGNVWEWCAD QWHRNYNGAP
TDGSVWLDGD KEITCVRGGS WDDFPNSCRS AFRLNYVRRD YR