Gene Tery_3568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3568 
Symbol 
ID4244317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5484593 
End bp5487043 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content41% 
IMG OID638108536 
Producthypothetical protein 
Protein accessionYP_723125 
Protein GI113477064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4354] Predicted bile acid beta-glucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.592893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAC AAAATCCTAT CCCTAACATT CCAGCCATAT CCTGGAATCG CCCCATTGGT 
CTTGATTGGG ACAAACCTTA CACAGTCCGT TATAGTAGTA ATATTGATGA TGGCCCATGG
CACGGAATGC CTCTGGGAGG TTTCGGTGCA GGTGCCACTG GTCGTTCCCC CCGGGGGGAT
TTTAATCTTT GGCATCTTGA CGGTGGGGAG CACATTTATC AAAATCTGCC CGCCTGTCAA
TTTAGCGTTT TTGAAGAAAC AAAAGGACAG AAACAAGCTT ATGCTTTATC TACCGAATTA
CCCACTGATG GTAGTCTATC TGCTTGGCAG TGGTATCCTC GGGAAAAAGC CGGGCTCAAG
ACTGGCACTT ATCACGCTCT TTACCCCCGC AGCTGGTTTG TTTATGAGAA TGTATTTACT
GCACAATTAA GTTGTGAACA ATTTTCCCCA ATTTTGGCAG GTAACTATCA AGAAACCAGT
TATCCGATCG CTATATTTGA ATGGACTGCC CATAATCCCA CAGATGAAGC AATTACACTT
AGCATAATGC TAAGTTGGCA AAATACTGTT GGTTGGTTTA CCAATTCTGT GAAAACACCA
GAAGTGAGAG TGCGAGATGA TGGTAGCCCA GTTTATGAAT ACAAACCTCG GTGGGGAGAG
AGTACAGACA ACTTTAACTT ATTAGTAGAA GATTTTCACC GTATTGGTTG CACTATGACC
AAATTAAGTA TTGCCGACGA ACCTGCAGAA GGAGAGGGGC AAATGGCGAT CGCGACTTTT
ATCAATGCCG GCATGGAAGT TTTCTATCAT ACAAGGTGGA ACCCTACTGG TACTGGGGAG
GATATCTGGC ATTACTTTGC TCTAGATGGC TCTCTCATAG ATGAAGAAAA CGAACTACCA
GCAACTGAGG GAGAACAAAT TGGTGTGGCT ATATCAGTTC GTTTTACTAT CAGACCAGGG
AAAAACAGGA AAATTCCTTT TTTTTTAGTT TGGGATTTAC CTGTGACTGA CTTTGGTAAT
GGGGTTTCAT ATTACCGCCG TTATACAGAT TTTTATGGTC GCAATGGCAA AAATGCCTGG
TCGATGATTC GTACTGCCAT GAAGCATTAT CAAACTTGGC GGGAAAATAT TGAAGCTTGG
CAAAATCCGA TTTTGCAACG GGAAGATTTG CCTAATTGGT TGAAAATGGC TTTGTTGAAT
GAACTTTATG ACCTCACCAG TGGTGGAACT ATTTGGGCGG CTGCTAGCGA TAGCGCTCCT
TACGGTCAGT TTGCTGTTCT GGAATGTCTT GATTATCGTT GGTATGAGAG TTTGGATGTT
AGGTTGTATG GTTCTTTTGG GTTGTTAATA TTGTGGCCAG AGTTGGAAAA GTCTGTTTTA
GTTGCGTTTG CGCGAGCAGT TTCGACAGCG GATGATACAC TGAGAATTAT TGGTTACAAT
CAAGTGTCGG CAGTGAGAAA GGTTGCTGGA GCTACTCCCC ATGACTTGGG TGCACCGAAT
GAGCATCCCT GGGAAATGAC AAATTATACC AGTTATCAGG ATTGTAATCA ATGGAAGGAT
TTATCTAGTG ATTTTGTATT GCAGGTATAT CGAGATTTTT TGTTGACTGG TGCAGATGAT
TATGAGTTTC TCTGGCAGTC TTGGTCTGCT ATTACGGAAA CTCTAGCATA TCTTAAGGGT
TTTGACAAAG ATAATGATGG CATTCCTGAA AATGAGGGAG CGCCTGACCA AACTTTTGAT
GATTGGCAGT TGCGGGGTGT GAGTGCATAT TGTGGTGGAC TATGGCTGGC TGCACTTGAG
GCAGCGATCG CTATTGGGAA AGTTTTAATT GAGCATCCTA GGGAAATTCC TTATTACCCG
CCTAAGGGTT TCTATTCTGA GGTTGATAAA AATTCTGTAG ATGCAATTAA TAATCAGGTT
TATTTGTATC AAGGTTGGTT AAAAAAAGGA TTGCCTATTT ATCAAGAAAA GTTATGGAAT
GGGGAATATT ATCGTTTGGA TAGTGAGAGT AATTCGGAAG TAGTAATGGC TGATCAATTA
TCGGGTCAAT TTTATGCCAA GCTGTTAAAT TTGGAGGATA TTGTCCCTGC TGAATGTGCT
TTGTCTGCTT TAAAAACTGT CTATAATTCT TGTTTTAAAA ACTTTCATAA TGGCAAGTTT
GGTGCGGCAA ATGGGGTGTT ACCTGATGGT TCTCCAGAGA ATCCTAATGC TACTCATCCT
TTAGAGGTTT GGACAGGAAT TAATTTTGGT TTAGCGGCTT TTATGGTGCA AATTGGTATG
AAAAAAGAAG CTCTGGAAAT AACTGAGGTA GTTGTGGGAC AAATTTATGA AAATGGTTTA
CAATTCCGCA CTCCAGAAGC TATTACAGTA ATGGGTACTT TTAGAGCTAG TCATTATTTA
AGGGCAATGG CAATTTGGGC AATTTATTTA GTCATAGAAG CAAAAAGGTG A
 
Protein sequence
MNQQNPIPNI PAISWNRPIG LDWDKPYTVR YSSNIDDGPW HGMPLGGFGA GATGRSPRGD 
FNLWHLDGGE HIYQNLPACQ FSVFEETKGQ KQAYALSTEL PTDGSLSAWQ WYPREKAGLK
TGTYHALYPR SWFVYENVFT AQLSCEQFSP ILAGNYQETS YPIAIFEWTA HNPTDEAITL
SIMLSWQNTV GWFTNSVKTP EVRVRDDGSP VYEYKPRWGE STDNFNLLVE DFHRIGCTMT
KLSIADEPAE GEGQMAIATF INAGMEVFYH TRWNPTGTGE DIWHYFALDG SLIDEENELP
ATEGEQIGVA ISVRFTIRPG KNRKIPFFLV WDLPVTDFGN GVSYYRRYTD FYGRNGKNAW
SMIRTAMKHY QTWRENIEAW QNPILQREDL PNWLKMALLN ELYDLTSGGT IWAAASDSAP
YGQFAVLECL DYRWYESLDV RLYGSFGLLI LWPELEKSVL VAFARAVSTA DDTLRIIGYN
QVSAVRKVAG ATPHDLGAPN EHPWEMTNYT SYQDCNQWKD LSSDFVLQVY RDFLLTGADD
YEFLWQSWSA ITETLAYLKG FDKDNDGIPE NEGAPDQTFD DWQLRGVSAY CGGLWLAALE
AAIAIGKVLI EHPREIPYYP PKGFYSEVDK NSVDAINNQV YLYQGWLKKG LPIYQEKLWN
GEYYRLDSES NSEVVMADQL SGQFYAKLLN LEDIVPAECA LSALKTVYNS CFKNFHNGKF
GAANGVLPDG SPENPNATHP LEVWTGINFG LAAFMVQIGM KKEALEITEV VVGQIYENGL
QFRTPEAITV MGTFRASHYL RAMAIWAIYL VIEAKR