Gene Tgr7_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_0947 
Symbol 
ID7314978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp1021885 
End bp1023753 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content67% 
IMG OID643615832 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002513022 
Protein GI220934123 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAC GTTTTGCCGC CAAAGACCTG ACCCTGATCC TTGGCCTGGT CATCCTGGGC 
GTGATGCTGA TCCTGCTCAT GTACCAGGTG GACCGCCAGT GGAGCCGCAT GAGCGAGATG
CAGCGGGCCA TGCAGGAGCA GTCCGACGAT CTGCGTCGGC TGCGCGTGGC CCTGCAGTCC
CTGGAGACCC GGGTGCGCAG CGGCGTGGCC GTGGGTGCAG GGGAGGGGGA TCAGCTGGCC
GATGTCCCGC CCGCCTTCCG GCGCGCGCAA CAGGCCGCCG CCCAGCCCGA CTTCGCCCCC
GGCGACTGGC TGGTGCAGGC CTTCCCCACG GGGCTGAGCA CCATCACCCC TCTGGTGTCC
GCCGACGCCT ATTCCCGGGA GGTGCAGAAC TACGTGCTGG AGTCCCTGAT CACCCGGGAT
CCGGACACCC TGGAATGGCA GGGCCTGATC GCCCGGGACT GGACCATCAG CGAGGACGGA
CTCACCATCA CCTTCCGCAT GCGCCCCGAC GTGAACTTCT CCGACGGCGT GCCGCTCACC
GCCCACGACG TGGTGTTCAC CTGGGCCTTC ATCATGAACG AGGCCATCGC CGCGCCCCGT
TCCCGGGCCT ACCTGGAGAA GCTGGAGAAG GTGGAGGCCC TGGATGATCA CACCGTGGCC
TTCACCTTTG CCGAGCCCTA CTTCAACAGC CTGTCCCTGG CCGGCGGCCT GGAGATCATG
CCGAGGCATT TCTACGCGCG TTTTCTGGAC GATCCCGAGG CCTTCAACCG TTCCCGCGGC
CTGCTGCTGG GTTCCGGCCC CTACCGGCTG TCCGACCCCG AGGGCTGGAC TCCGGACCAG
GGCCTGGTGG AGCTGGTGCG CAATCCGCGC TACTGGGGGC CGGTGGATCC GCCCTTCAAC
CGGGTGCTCT GGCGTGTCAT CGAGAACGAC AGCGCCCGCC TGACCACCTT CCGCAACGGC
GAGATCGACG CCTACGGCGC ACGTCCCCGT GAGTACCAGC GCCTGCTGGA GGACGAACAG
TTGCGCAGCC GCACCCGGCA CTTCGAGTAC ATGAGCCCCA CCGCCGGCTA CAACTACATC
GGCTGGAACC AGGAGCGGGA CGGGCGTGCA ACCCGCTTCG CCGATCCTCG GGTGCGCCAG
GCCATGAGCC ACCTCACGCC GATCGAGCGC ATCATCGACG AGATCATGCT GGGCTATGCG
GAGCCGGCGG TCAGCCCCTT CAACCCCCGC AGCCCGCAGC ACGACACGAG CCTCGAACCC
TATGCCTTCG ACATCGAGCG GGCCACGCAG CTGCTGCACG AGGCGGGCTA CCGCAGCCGC
AACCGTGACG GCATCCTGGT GGACGAACAG GGCAGGCCAT TCGAGTTCGA GCTGGTGTTC
TTCCAGGACA ACGAGGACAC CCGGCGCATC GTGCTGTTCC TGCGCGACAT CTACGCCCGG
GCCGGTATCC TGCTGCGTCC GCGGCCCACG GAGTGGTCGG TGATGCTGGA CCTGCTCACC
CGCAAGGACT TCGACGCCAT CACCCTGGGC TGGACCAGCG GCGTGGAGAC CGACATCTAC
CAGATGTTCC ACTCCAGCCA GACCGTGGCC GGCGGCGACA ACTTCATCAA CTACCGCAAC
CCGGAACTGG ACCGGCTCAT CGACCAGGCC CGGGCCGAGG TGGACGAGGC CGCGCGCATG
GCACTCTGGC AGCAGGTCGA GCGCATCCTG TACGAGGATC AGCCCTACAC CTTCCTCATG
CGCCGCCAGA CGCTCGCCTT CATCGACCAG CGCCTGCACA ACCTGCAGAT CACCAATCTG
GGCCTGAATC TCGGTGCGGT GCCCGTGGAG ACCTATGTGC CAGCGGATAT GCAGAGGTAC
ACGAGATGA
 
Protein sequence
METRFAAKDL TLILGLVILG VMLILLMYQV DRQWSRMSEM QRAMQEQSDD LRRLRVALQS 
LETRVRSGVA VGAGEGDQLA DVPPAFRRAQ QAAAQPDFAP GDWLVQAFPT GLSTITPLVS
ADAYSREVQN YVLESLITRD PDTLEWQGLI ARDWTISEDG LTITFRMRPD VNFSDGVPLT
AHDVVFTWAF IMNEAIAAPR SRAYLEKLEK VEALDDHTVA FTFAEPYFNS LSLAGGLEIM
PRHFYARFLD DPEAFNRSRG LLLGSGPYRL SDPEGWTPDQ GLVELVRNPR YWGPVDPPFN
RVLWRVIEND SARLTTFRNG EIDAYGARPR EYQRLLEDEQ LRSRTRHFEY MSPTAGYNYI
GWNQERDGRA TRFADPRVRQ AMSHLTPIER IIDEIMLGYA EPAVSPFNPR SPQHDTSLEP
YAFDIERATQ LLHEAGYRSR NRDGILVDEQ GRPFEFELVF FQDNEDTRRI VLFLRDIYAR
AGILLRPRPT EWSVMLDLLT RKDFDAITLG WTSGVETDIY QMFHSSQTVA GGDNFINYRN
PELDRLIDQA RAEVDEAARM ALWQQVERIL YEDQPYTFLM RRQTLAFIDQ RLHNLQITNL
GLNLGAVPVE TYVPADMQRY TR