Gene Bcep18194_A5477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5477 
Symbol 
ID3750698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2561959 
End bp2563347 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content70% 
IMG OID637763786 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_369715 
Protein GI78066946 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGCC TCATGACCGA CACCCTGTTG TTCGCAGACC ATGCGTACCT GCCCGAAGGC 
TGGCGCCGTA ACGTGCTGCT GCGCTGGGAC GCGGCCGGCA CGCTGACCGG CGTGACGCCC
GACACCGACG CACCCGCTGG CGTCGCGCGC GCGGCCGGGC CCGTGATGCC CGGTATGCCG
AACCTGCATT CGCACGCGTT CCAGCGCGCG ATGGCGGGGC TCACCGAATA CCGCGCGAAT
CCGTCCGACA GCTTCTGGAG CTGGCGCGAC CTGATGTACC GCTTCGCGCT GAAGATCACG
CCCGACGCGC TCGCGGCAAT CGCGCGCTGG CTGTATGTCG AGATGCTCAA GTGCGGCTAC
ACGTCGGTGT GCGAATTCCA CTACGTGCAC CACACGCAGG ACGGCTCGCG CTATCCGCAG
ATCGCCGAGC TCGGCACGCG CGTGATCGAC GCCGCACGCG CGGCCGGCAT CGGCATCACG
ATGCTGCCGG TGTCGTACCA GTTCGCCGGC TTCGGCGACA AGCCGCCGCG CGACGACCAG
CGCCGTTTCA TCAATACGCC CGACGGCCTG CTCGAGCTGC TCGACGCGAT GCGTCGCGTG
GCGCCGGAGC ACGGCGGGCT GCGCTACGGC GTTGCGCCGC ACTCGCTGCG GGCGGTATCC
GAGAACGGGT TGCGCGTGCT GCTCGAAGGG TTGCCCGGCG ATGCGCCCGT GCACATCCAT
ATCGCCGAGC AGACGGCCGA AGTCGACGAC TGCGTGCGTG CCTACGGTGC GCGCCCCGTG
CAATGGCTGC TCGATCGCTT CGACGTCGAT GCGCGCTGGT GCCTGGTGCA CGCGACGCAC
GTCGACGCGG CCGAAACGGC AGCGCTCGCC AAGCGTCGCG CGGTCGCCGG CCTGTGCCTG
ACGACCGAAG CAAATCTCGG CGACGGCGTG TTCCCGGCCG TCGACTATCT CGCGCAGGGT
GGTGTGATCG GTGTCGGCTC GGACAGCCAC GCGTCGGTCG ACTGGCGCTC GGAATTGCGC
CTGCTCGAAT ACGGGCAGCG GCTCGTGCAT CGCGCGCGCA ACGTGCTGGC GAGCGACACG
CAGGCGCACG TCGCCGATCG CCTGTTCGAC GCTTCGCTTG CGGGCGGTGC ACAGGCCAGC
GGGCGGCATG TCGGTGCGCT GCGCGAAGGG TGCCGCGCCG ACTGGCTCGT GCTCGATCCC
GATCATCCGG CGATCGCCGA ACACGACAGC ACGTCGTGGT TGTCGGGTAT CGTGTTCGCG
GAGCACGGCG AGACGCCGGT GCTCGACGTC TACACGGGCG GCGAGCGCGT CGTGAGCGGC
CGCCGTCATC GCGACGAAGC CGTCGCGTAT GCCGACTACC GCGCCGCGCT GGCGCAACTG
CTGCGCTGA
 
Protein sequence
MDSLMTDTLL FADHAYLPEG WRRNVLLRWD AAGTLTGVTP DTDAPAGVAR AAGPVMPGMP 
NLHSHAFQRA MAGLTEYRAN PSDSFWSWRD LMYRFALKIT PDALAAIARW LYVEMLKCGY
TSVCEFHYVH HTQDGSRYPQ IAELGTRVID AARAAGIGIT MLPVSYQFAG FGDKPPRDDQ
RRFINTPDGL LELLDAMRRV APEHGGLRYG VAPHSLRAVS ENGLRVLLEG LPGDAPVHIH
IAEQTAEVDD CVRAYGARPV QWLLDRFDVD ARWCLVHATH VDAAETAALA KRRAVAGLCL
TTEANLGDGV FPAVDYLAQG GVIGVGSDSH ASVDWRSELR LLEYGQRLVH RARNVLASDT
QAHVADRLFD ASLAGGAQAS GRHVGALREG CRADWLVLDP DHPAIAEHDS TSWLSGIVFA
EHGETPVLDV YTGGERVVSG RRHRDEAVAY ADYRAALAQL LR