Gene EcSMS35_0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0432 
SymbolbrnQ 
ID6144246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp441866 
End bp443185 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content56% 
IMG OID641615328 
Productbranched-chain amino acid transport system II carrier protein 
Protein accessionYP_001742535 
Protein GI170680091 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1114] Branched-chain amino acid permeases 
TIGRFAM ID[TIGR00796] branched-chain amino acid uptake carrier 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.922576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATC AATTAAGATC GCGCGATATC ATCGCTCTGG GCTTTATGAC ATTTGCGTTG 
TTCGTCGGCG CAGGTAACAT TATTTTCCCT CCAATGGTCG GCTTGCAGGC AGGCGAACAC
GTCTGGACTG CGGCATTCGG TTTCCTCATT ACTGCCGTTG GCCTACCGGT GTTAACGGTG
GTGGCGCTGG CGAAAGTTGG CGGCGGTGTT GACAGCCTCA GCACGCCAAT CGGTAAAGTC
GCTGGCGTAC TGCTGGCAAC GGTGTGTTAC CTGGCGGTGG GGCCGCTTTT CGCTACGCCG
CGTACAGCTA CCGTTTCTTT TGAAGTGGGC ATTGCGCCGC TGACGGGTGA TTCCGCGTTG
CCGCTGTTTA TCTACAGCCT GGTTTATTTC GCTATCGTTA TTCTGGTTTC GCTCTATCCG
GGCAAGCTGC TGGATACCGT GGGCAACTTC CTTGCGCCGC TGAAAATTAT CGCGCTGGTC
ATCCTGTCTG TTGCCGCAAT TATCTGGCCG GCGGGTTCTA TCAGCACGGC GACTGAGGCT
TATCAAAACG CTGCGTTTTC TAACGGCTTC GTCAACGGCT ATCTGACCAT GGATACGCTG
GGCGCAATGG TGTTTGGTAT CGTTATTGTT AACGCGGCGC GTTCTCGTGG CGTTACCGAA
GCGCGTCTGC TGACCCGTTA TACCGTCTGG GCTGGCCTGA TGGCGGGTGT TGGTCTGACT
CTGCTGTATC TGGCGCTGTT CCGTCTGGGT TCAGACAGCG CGTCGCTGGT CGATCAGTCT
GCAAACGGTG CGGCGATCCT GCATGCTTAC GTTCAGCATA CCTTTGGCGG CGGCGGTAGC
TTCCTGCTGG CGGCGTTAAT CTTCATCGCC TGCCTGGTCA CGGCGGTTGG CCTGACCTGT
GCTTGTGCAG AATTCTTCGC CCAGTACGTA CCGCTCTCTT ATCGTACGCT GGTGTTTATC
CTCGGCGGCT TCTCGATGGT GGTGTCTAAC CTCGGCTTGA GCCAGCTGAT TCAGATCTCT
GTACCGGTGT TGACCGCCAT TTATCCGCCG TGTATCGCAC TGGTTGTATT AAGTTTTACA
CGCTCATGGT GGCATAATTC GTCCCGCGTG ATTGCTCCGC CGATGTTTAT CAGCCTGCTT
TTTGGTATTC TCGACGGGAT CAAGGCATCT GCATTCAGCG ATATCTTACC GTCCTGGGCG
CAGCGTTTAC CGCTGGCCGA ACAAGGTCTG GCGTGGTTAA TGCCAACAGT GGTGATGGTG
GTTCTGGCCA TTATCTGGGA TCGTGCGGCA GGTCGTCAGG TGACCTCCAG CGCTCACTAA
 
Protein sequence
MTHQLRSRDI IALGFMTFAL FVGAGNIIFP PMVGLQAGEH VWTAAFGFLI TAVGLPVLTV 
VALAKVGGGV DSLSTPIGKV AGVLLATVCY LAVGPLFATP RTATVSFEVG IAPLTGDSAL
PLFIYSLVYF AIVILVSLYP GKLLDTVGNF LAPLKIIALV ILSVAAIIWP AGSISTATEA
YQNAAFSNGF VNGYLTMDTL GAMVFGIVIV NAARSRGVTE ARLLTRYTVW AGLMAGVGLT
LLYLALFRLG SDSASLVDQS ANGAAILHAY VQHTFGGGGS FLLAALIFIA CLVTAVGLTC
ACAEFFAQYV PLSYRTLVFI LGGFSMVVSN LGLSQLIQIS VPVLTAIYPP CIALVVLSFT
RSWWHNSSRV IAPPMFISLL FGILDGIKAS AFSDILPSWA QRLPLAEQGL AWLMPTVVMV
VLAIIWDRAA GRQVTSSAH