Gene Tery_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1604 
Symbol 
ID4242987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2451467 
End bp2452744 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content34% 
IMG OID638106746 
Productextracellular solute-binding protein 
Protein accessionYP_721356 
Protein GI113475295 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTCA TAGTTAAATG GAAAAGGTTT AAAATATTTA CTATTCTGGG GCTAATTATA 
ACAATTTTTA TTGGTTGCAC TTCTCCCTAT CCTATTTTAA AGACTAAAGA AATTGAATTT
TGGACAATAC AGCTTCAACC TCAGTTTACT AATTATTTTA ATCAGTTAAT TGCTGATTTT
GAAGCAGATA ATCCAGATTT ATCAGTGCGA TGGGTTGATT TACCTTGGTC GGAGATAGAA
AGCAAAATTG AGAGTGCTAT TGTCACAGAA ACTCTTCCAG ACATTGTGAA TCTTAACTCA
AGTTTCACAC ATTTATTAAC TCGACCTAAT GCCTGGCTAA ATTTAGATAC TATTTTATCA
GATTCGGTGA GGAATGAATA TTTACCTAAT CTTTGGCAAT GTAGTCAGAT TAATGGTAAA
AGTCTTGGTA TCCCATGGTA TGGTACGATG AATATAACAA TTTATAATTC AGAGCTATTT
AAAGAGGCAG AACTCGAAGA ACCTCCAACA AATTATACTG AATTAGCTAA GGTCGCTAGA
CAAATTAAAG AAAAAACTGA TAAATATGCA TTCTTTGTTT CTTTTTCGCC ACAAGGAGGA
AATGAGGTAT TAGAGTCTTT TGGCAAGATG GGAGTTGAGT TAATAAATGT TGATGGAAAA
GCTGCTTTTA ATACTCCACT CGGTGAGGCA GTATTTAAGT ATTGGGTTGA TCTATATAGA
GAAGATTTAT TACCAGAGGA AGTATTGACC GAAGGAATTC AAGCAGGTGA AGATTTGTAT
CAAGCTGGAG AAACTGCTAT GGTATTTTCT GGACCAGAGT TTATTACAGC CATATCTGAA
AATATACCTG AAATTGGAAA GGTATCATTC CCAACATCTC AAGTTACAGG TAAGACTGAT
AAGAAGGGTA TAGTTTTGAT GAATTTTGCT ATTTCTAGCA AAACTAAATT ACCAGATGGA
GCCCTGAAGT TTGTTTTGTT TATTACTAAT TATCAAAATC AAATGGCTTT TGCTCAGGAG
ACTAATGTTT TACCCTCTAC AACTAAGACC TTAAAAAATA GTTATTTTCA AAATATTTCT
TCTGATGCTT CTACTCAAGA TCGGGGTCGA GTTATTAGTG CTAATCAAAT TTTGGCAGCA
GAAGTTTTAC TGCCACCTAT TAAAAATTTG GATAAGTTGA AACAAATTAT TTATCAAAGT
TTACAGGAGG CTATGTTGGC AGAAAAAACT ATTGAGCAAG CGATCGCTGA TGCCGCATCT
GAATGGGATA AGCTTTGA
 
Protein sequence
MFLIVKWKRF KIFTILGLII TIFIGCTSPY PILKTKEIEF WTIQLQPQFT NYFNQLIADF 
EADNPDLSVR WVDLPWSEIE SKIESAIVTE TLPDIVNLNS SFTHLLTRPN AWLNLDTILS
DSVRNEYLPN LWQCSQINGK SLGIPWYGTM NITIYNSELF KEAELEEPPT NYTELAKVAR
QIKEKTDKYA FFVSFSPQGG NEVLESFGKM GVELINVDGK AAFNTPLGEA VFKYWVDLYR
EDLLPEEVLT EGIQAGEDLY QAGETAMVFS GPEFITAISE NIPEIGKVSF PTSQVTGKTD
KKGIVLMNFA ISSKTKLPDG ALKFVLFITN YQNQMAFAQE TNVLPSTTKT LKNSYFQNIS
SDASTQDRGR VISANQILAA EVLLPPIKNL DKLKQIIYQS LQEAMLAEKT IEQAIADAAS
EWDKL