Gene Tery_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4110 
Symbol 
ID4245624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6340300 
End bp6342288 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content37% 
IMG OID638109011 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_723591 
Protein GI113477530 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components
[COG4176] ABC-type proline/glycine betaine transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.580114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTATTT TGAATTATTT ATCTTATTAC TTGATCACTG AAACTCAAGG AGGATTAAAC 
AATTTATTGA ACCCTTTCGA GTCTTATACT TTACCCCTGG AGGAATGGGT TAACAATTTA
GTTGATTTTC TGGTTGACAA CTTTCGGTTT ATATTCCAAG CTATCAGCTT ACCTATTAGC
GGAACTCTAA GAATAGTAGA ATTGACTTTT CTAGCAATTC CGCCTTTAAT TTTTCTCATT
ATCACTGGTT TGCTGGTCTG GCAATTAGCA GGTAGAAAAA TAGCTATATA CAGTTTTATT
TCTCTTTGTA TTATTGGTTT TTGTGGTGCT TGGGAACAAT CAATGGTTTC CCTATCTTTA
ATACTTACTG CTGTAATATT CTGTATGCTT ATTGGTATAC CTCTAGGTAT TGCTTGTGCC
CTGAGCGATC GCTTTAATAA AATCTTGCGA CCTCTCCTAG ATGGGATGCA AACTCTTCCA
ACTTTTGTTT ATCTGGTACC TGTAGTAATG CTATTTGGTA TTGGCGAAGT TTCTGGTATA
ATAGCGACTT TTGTTTTTGC AGTTCCTCCT CTAATTCGCC TGACCAACCT GGGCATTAGA
CAGGTGTCCC CAGAAGCGGT AGAGGCTGCA CTTGCGTTTG GTTCAACTCG CCAACAAGTT
TTGTTGGAAG TGCAAATACC TCTAGCTTTA CCTACTATAC TTGCTGGTAC GAACCAAGCC
ATTTTATTAG CCTTATCGAT GTCAGTTGTC ACCTCAATGA TTGGAGTAGA GGGATTAGGA
CAAATGGTAT TACAAGGTTT AGGACGTTTG AATGTGGGAC TAGCGGCAAT AGGGGGCTTA
GGTATTGTAT TAATTGCTAT AATGTTAGAC CGTATTACTC AAGGTGTTAG TCAAGGTAAT
ATTCAGTCTT GGCAAGACCG TGGTCCTATC GGTTGGTGGC GGTCTCGTAA GTTTTTCAAA
CATCATAAAA CAACTTTAGG GATAAGTATT CTGCTTGCCT TGTTAGTTGG TGTGACAGGT
TGGCAATTAA TGTCTCAAAA AAATATCAAG TCAACTGAGC ATCTACTGCC AGGGGAAGGA
GTAGTGGTGC GTCCAACTTC TGGCTTAGAA ACCTACGGTA TATTCACCAC AGAAATTGTG
AATATTGGAC TGGAAAAATT AGGATATGAG ATAAGAGGTG AAAAGCAACT GAATGTTCCA
GCAATGCATC TGGCTGTTAG CAATGGAGAT TTAGATTTTG CTGGAACCCA TTGGGAAGCT
AGCCATCAAG AATTTTTTGA CAATAATGGT GGGGAAGAAA AATTAGAACG GCTAGGAACT
CTGATTTCTA ATTCTACTAT GGGGTATCAG ATAGATAAAA AAACAGCAGA TAAGTACAAT
ATTACTAATA TATCACAACT TAAAGAACCA AAAATTGCTA AACTTTTTGA CTCTGATGGT
GATGGTAAGG CAAATTTAAT TGGTTGTACT GCTGGTTGGT CGTGCGAAAG AGTCATAAAT
CATCATTTAG AGGTTTATGG ACTTGAAGAT ACTGTTGAGC AAATTCAAGG AAATTATTCT
TCTTTGTTGG CAGATATTTT AGTTCGTTAT CGTCAAGGAA AACCAATTTT ATTTTTTGCT
TGGAAACCCC ATTGGTTTTC TTCTATTTTA AGAGAAGGAG AAAATGTAGA ATGGTTGAGT
GTTCCTTTTA CATCTTTGGT AGGAACTATG GAAAACTTCA CGGAAAAAGA TACTTTATTT
AATGATAAAA ATATCGGTTT TCCTATTGAT AATGTCAGGA TATTAGCTAA TAAGAAATTC
TTGAAGGCTA ACCCAGTTGC TAAACGTTTA TTTGAACAGA TTGAGATTCC TTTAGAGGAT
GTTAGTATTG AACAGGAAAA AGTAAAAAAT GGAGAAAATA AACCTATTGA TATTCGCCAT
CATGCTGAGG AATGGATAGT TAATCATCAA GGGTTATTTG ATAGTTGGTT AGAAATAGCA
AAAAGTTAA
 
Protein sequence
MFILNYLSYY LITETQGGLN NLLNPFESYT LPLEEWVNNL VDFLVDNFRF IFQAISLPIS 
GTLRIVELTF LAIPPLIFLI ITGLLVWQLA GRKIAIYSFI SLCIIGFCGA WEQSMVSLSL
ILTAVIFCML IGIPLGIACA LSDRFNKILR PLLDGMQTLP TFVYLVPVVM LFGIGEVSGI
IATFVFAVPP LIRLTNLGIR QVSPEAVEAA LAFGSTRQQV LLEVQIPLAL PTILAGTNQA
ILLALSMSVV TSMIGVEGLG QMVLQGLGRL NVGLAAIGGL GIVLIAIMLD RITQGVSQGN
IQSWQDRGPI GWWRSRKFFK HHKTTLGISI LLALLVGVTG WQLMSQKNIK STEHLLPGEG
VVVRPTSGLE TYGIFTTEIV NIGLEKLGYE IRGEKQLNVP AMHLAVSNGD LDFAGTHWEA
SHQEFFDNNG GEEKLERLGT LISNSTMGYQ IDKKTADKYN ITNISQLKEP KIAKLFDSDG
DGKANLIGCT AGWSCERVIN HHLEVYGLED TVEQIQGNYS SLLADILVRY RQGKPILFFA
WKPHWFSSIL REGENVEWLS VPFTSLVGTM ENFTEKDTLF NDKNIGFPID NVRILANKKF
LKANPVAKRL FEQIEIPLED VSIEQEKVKN GENKPIDIRH HAEEWIVNHQ GLFDSWLEIA
KS