Gene Ent638_3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3157 
Symbol 
ID5111710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3437129 
End bp3438331 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content50% 
IMG OID640493356 
Productglycine betaine transporter ATP-binding subunit 
Protein accessionYP_001177872 
Protein GI146312798 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0735792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTA AATTAGAAGT TAAAAATCTG TATAAAGTAT TTGGCGATAA TCCGCAGCGA 
GCCTTCAAAT ATATTGAAAA AGGACTTTCA AAAGAATTAA TCCTGGAGAA AACAGGGCTT
TCGCTTGGCG TTAAAGACGC CAGTCTGGCC ATTGAAGAAG GCGAGATTTT CGTCATCATG
GGATTATCCG GTTCGGGTAA ATCCACCATG GTACGCCTTC TCAATCGCCT GATTGAACCC
ACTCGTGGAC AGGTGCTGAT TGACGGCGTG GATATCGCAA AAATATCCGA TGCTGAGCTG
CGCGAAGTGC GCAGGAAAAA GATTGCAATG GTCTTCCAGT CATTCGCGCT AATGCCACAT
ATGACGGTAT TGGATAATAC CGCCTTCGGT ATGGAATTAG CGGGAATCCC TGCGGCTGAG
CGCCAGGAAA AAGCGCTGGA TGCATTGCGT CAGGTTGGAC TTGAAAATTA CGCTCACGGT
TATCCGGATG AACTCTCGGG CGGTATGCGC CAGCGTGTGG GTTTGGCCCG CGCATTAGCG
ATTAACCCCG ATATCTTATT AATGGATGAA GCCTTCTCGG CGCTCGATCC TTTAATTCGT
ACCGAGATGC AGGATGAACT GGTAAAACTT CAGGCAAAAC ATCAGCGCAC CGTGGTGTTT
ATTTCCCACG ATCTGGATGA AGCCATGCGA ATTGGCGACC GTATTGCCAT TATGCAAAAT
GGCGAAGTGG TTCAGGTCGG CACCCCGGAC GAAATTCTGA ATAATCCGGC AAATGATTAT
GTGCGGACCT TCTTCCGTGG CGTAGATATT AGCCAGGTCT TTAGCGCCAA AGATATTGCC
CGTCGCGCGC TGAACGGCAT TATTCGTCGT ACGCCTGGTT TTGGCCCGCG ATCGGCGTTG
AAGCTGCTAC AGGATGAAGA CCGCGAATAC GGCTATGTGA TTGAACGCGG TAATAAATAT
GTTGGCATTG TCTCCATTGA TTCACTGAAA AGCGCGTTAA GCGAAAATCT GGGAATCGAT
GCGGCGTTAA TTGACGCTCC ACTTGCCGTG GACGCCGAAA CACCGCTCAG CGAGTTGCTC
TCTCATGTGG GTCAGGCGCC GTGCGCCGTA CCGGTTATCG GAGAAGAACA ACAATACGTC
GGCATCATCT CAAAACGGAT GTTGCTGCAG GCTTTAGATC GCGAGGGGGC AAACAATGGC
TGA
 
Protein sequence
MAIKLEVKNL YKVFGDNPQR AFKYIEKGLS KELILEKTGL SLGVKDASLA IEEGEIFVIM 
GLSGSGKSTM VRLLNRLIEP TRGQVLIDGV DIAKISDAEL REVRRKKIAM VFQSFALMPH
MTVLDNTAFG MELAGIPAAE RQEKALDALR QVGLENYAHG YPDELSGGMR QRVGLARALA
INPDILLMDE AFSALDPLIR TEMQDELVKL QAKHQRTVVF ISHDLDEAMR IGDRIAIMQN
GEVVQVGTPD EILNNPANDY VRTFFRGVDI SQVFSAKDIA RRALNGIIRR TPGFGPRSAL
KLLQDEDREY GYVIERGNKY VGIVSIDSLK SALSENLGID AALIDAPLAV DAETPLSELL
SHVGQAPCAV PVIGEEQQYV GIISKRMLLQ ALDREGANNG