Gene BamMC406_5776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_5776 
Symbol 
ID6182521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010557 
Strand
Start bp303461 
End bp304669 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID641688912 
Productectoine utilization protein EutD 
Protein accessionYP_001815771 
Protein GI172065059 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID[TIGR02993] ectoine utilization protein EutD 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.230868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAG TCATCGAAAC CCACCAAGCC GTGCCACGCC TCGCGTTCGA GCGCAGCGAA 
TACGCCGCGC GCATCGCGAA GACGCGCACG GCGATGCAGC GGGCCGGCAT CGACCTGTTG
ATCGTCACCG ACCCGACCAA CATGGGCTGG CTCACCGGCT ATGACGGCTG GTCGTTCTAC
GTACACCAGT GCGTGCTGCT GCCGATGGAC GGCGAGCCCG TCTGGTACGG CCGCGGCCAG
GACGCGAACG GCGCGAAGCG CACCGTGTTC ATGGCGCACG AGAACATCGT CGGCTACCCG
GATCACTATG TGCAGTCGAC GGCCCGTCAC CCGATGGACT ACCTGTCGAC CGACGTGATT
GCCGCACGCG GCTGGAGCAC GCTGCGCATC GGCGTCGAGC TCGACAACTA TTACTTCAGC
GCGGCGGCGT ACGCGTCGCT GCAGAAGCAT CTGCCGGCCG CGCGCTGGGT CGACGCGACC
GCGCTCGTGA ACTGGCAGCG CGCGGTGAAG TCGCCGCGCG AGATCGAGTA CATGCGCGTT
GCCGCACGGA TCGTCGAGCG CATGCATGCG CACATCGTCG ACACGATCGA GCCCGGCATG
AAGAAGAGCG ATCTCGTCGC GCAGATCTAT GCGACCGGGA TCGGCGGCGC GGACGGCTTC
GGCGGCGACT ATCCGGCGAT CGTCCCGCTG CTGCCGACCG GCGCCGATGC GGCCGCGCCG
CACCTGACGT GGGACGACAC GACGTTCGCG CGCGGCGCGG GCACGTTCTT CGAGATCGCG
GGCTGCTACC GCCGCTATCA CTGCCCGCTG TCGCGCACCG TCTATCTCGG CAAGCCGCCC
GCGCACTTCA TCGAAGGCGA GCGCGCGGTG GTCGAAGGGA TCGAAGCCGG GCTCGCGGCC
GCGAAGCCCG GCAACGTGTG CGAGGACATC GCGAACGCGT TCTTCGCGGT GCTGCGCCGC
GCGGGCATCG AGAAGGACAG CCGCTGTGGC TACCCGATCG GCGCGAGCTA TCCGCCGGAC
TGGGGCGAGC GCACGATGAG CCTGCGCCCG GGCGACCGCA CGGTGCTCGA ACCCGGCATG
ACGTTCCATT TCATGCCGGG GCTGTGGCTC GACGACTGGG GTCTGGAGAT CACTGAAAGC
ATCCTGATCA CCGACACCGG CGTCGAGACG TTCTGCAACA CGCCGCGCAA GCTGTTCGTG
AAGGAGTAG
 
Protein sequence
MSAVIETHQA VPRLAFERSE YAARIAKTRT AMQRAGIDLL IVTDPTNMGW LTGYDGWSFY 
VHQCVLLPMD GEPVWYGRGQ DANGAKRTVF MAHENIVGYP DHYVQSTARH PMDYLSTDVI
AARGWSTLRI GVELDNYYFS AAAYASLQKH LPAARWVDAT ALVNWQRAVK SPREIEYMRV
AARIVERMHA HIVDTIEPGM KKSDLVAQIY ATGIGGADGF GGDYPAIVPL LPTGADAAAP
HLTWDDTTFA RGAGTFFEIA GCYRRYHCPL SRTVYLGKPP AHFIEGERAV VEGIEAGLAA
AKPGNVCEDI ANAFFAVLRR AGIEKDSRCG YPIGASYPPD WGERTMSLRP GDRTVLEPGM
TFHFMPGLWL DDWGLEITES ILITDTGVET FCNTPRKLFV KE