Gene Tery_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1451 
Symbol 
ID4245765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2191817 
End bp2193394 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content37% 
IMG OID638106604 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_721214 
Protein GI113475153 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins
[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAATC TTACCCTTAG TCCTTGTCAA TCTTTATCTT TCACTCAAGG CTCACTTAAG 
GTTAATAAAT ATTCAATGGA TCTCTCCTAT TCAAATTTAA CAGGAGCAAA TCTCTCAGGT
GCTAATCTTG CCGGAATTAA TTTACAGGGG AGTAATCTTC AAGGTGCAAA CTTGGTAAAT
GCTAATTTAG AAGGTGCGAA TTTAAAGGAC GTCAATTTAG AAGGTGCTAA TTTAGCACGC
GCTAACTTAA AGAAAGCCAT ACTTCAAAAT AGTAATTTAG ATAATAGTAA TTTGTATGGA
TCTGACCTTC AAGCAGCTGA TTTCTCTGAA GCTAATCTCG TCAATATGAA AGCTTTATGG
GCTAATTTTC ATAATGCTAT TTTCCATCGG GCAAACTTAG AATCAGCGAA TTTTAATCGA
GCAAATTTAA GAGGGGCTGA TTTTTATAAA GCTAATCTAG AGAATGCCTC CTTGCGTTTT
ACCGATTTTG GTAGCACGAC TAACGTGATC GAAGCTAAAT TAAACCCCAC TAATTTCCGA
GAAACTCAGT TAAAAGGTGC TGATCTGTGG GGAGCAAAAA TGTGGTCAAT ATTTCAAATT
AAACAGGCTA AAAATTGGCA GGAAACCAAT AGGATGCCTA ATTGGGAACA GCAAATCAAA
CAAGCACGCT TACCCCGTTT AAGGATAGCT TTGCTCAAAC CAGAAAATGC TGACAGTATT
TCTGATACCT ATGAATTTGG GATGCGTCGT GCTGCTAACC GTCGTGTGGA AATTTGGGGT
ATTTCTTATC CCGGTGGTGT GAAGAACGAA GCGAAAATAA TTAGGCAGTT GATTAAGGAT
GGCATGGATG GGATTATCTT AACGCCTGAA GATCCTGTTC AGTCCCTTGA TGCCCTGAAA
CTGGCAAGGG ATGCTGGAGT AGCTATTACC ACTGTTGATT TCTGCTTTAA TCCTATAGAT
GCAGAAGATT TAGCTATTGC TTGCTATAAT ACAAATAGTT TTCAAATGGG CTATGATTCA
GGCCAATATA TAGCAGAATG GGCTCAAAAG AATTTACAGT CAAAATCGGT TCAAATTGGT
TTAGTTGATG GTGCTGTATA TGATCGCTAC TATCCCTATC TTCAAGGAGT CTTAAAAGCG
ATCAACCATT CGGGTATCCC TTTTCAAATT GTTGACTCTG TTAGCATTGC TTTTGGTAGT
GATATTATAA AAGTTAAAAA ACTACTTGAA GATAATCCTG ATGTTCAGAT ACTTTGGGGG
GGATCAAATA TAGCAACGGA GGTTACAGTT GCAGCAGTAG CAGAATCTGC CTTGAAGAAT
AAAGTTAAGG TTTTTGGAAT TTTAGACCTC TCAAGAAATA AAGCTACAAA ATTACTTAAT
CCCAATAGTC CTTTACAGTT GATTATTGAG CAGTCTAGTA TTCAGATTGG TTATGAGGCT
GTGAAAACAA CAATTTCTGT CTTGAGAAAA GAAATAGACG GAGCAGATTA TCAGGTTTAC
CCCGTTAAGC ACCGTCTATT AACTCAAAAT GACCCAGACA TTGTAAGTGA TATACTCAAC
GACTCAAGTT TAGAATAA
 
Protein sequence
MANLTLSPCQ SLSFTQGSLK VNKYSMDLSY SNLTGANLSG ANLAGINLQG SNLQGANLVN 
ANLEGANLKD VNLEGANLAR ANLKKAILQN SNLDNSNLYG SDLQAADFSE ANLVNMKALW
ANFHNAIFHR ANLESANFNR ANLRGADFYK ANLENASLRF TDFGSTTNVI EAKLNPTNFR
ETQLKGADLW GAKMWSIFQI KQAKNWQETN RMPNWEQQIK QARLPRLRIA LLKPENADSI
SDTYEFGMRR AANRRVEIWG ISYPGGVKNE AKIIRQLIKD GMDGIILTPE DPVQSLDALK
LARDAGVAIT TVDFCFNPID AEDLAIACYN TNSFQMGYDS GQYIAEWAQK NLQSKSVQIG
LVDGAVYDRY YPYLQGVLKA INHSGIPFQI VDSVSIAFGS DIIKVKKLLE DNPDVQILWG
GSNIATEVTV AAVAESALKN KVKVFGILDL SRNKATKLLN PNSPLQLIIE QSSIQIGYEA
VKTTISVLRK EIDGADYQVY PVKHRLLTQN DPDIVSDILN DSSLE