Gene Bcep18194_A4759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4759 
Symbol 
ID3749967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1755121 
End bp1756233 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content68% 
IMG OID637763056 
Productpeptidyl-arginine deiminase 
Protein accessionYP_368998 
Protein GI78066229 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.675137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.251109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGA GACGTCAACT GTTGAAACGC GGCCTGTCCG TATCTGGGGC GGCGCTGGCC 
GCCAGCGCGC TGGGCGGGTT GCTCGGCCGC GCCGCGCACG CGCAGCAGGG CGCAACCTGG
CACATGCCGG ACGAGGGTGC GCCGCACACG GCGACGTGGA TGGCGTTCGG CCCGAGCGAG
GACATCTGGG GTGCGCGGCT GTTGCCCGTC GCGCGCGCGA ACCTGGCCGC GATCGCGAAG
GCGATCGCCG CGCACGAGCC GCTCAAGATG CTGGTGCGCG AGCAGGACTA CGCAATCGCG
TCGCGGCTGT GCGGTTCATC GGTCGAGCTC GTCCAGCATC CGGTCGACGA TCTGTGGATG
CGCGACACGG GGCCCGTGTT CGTGAAGAAC GCGTCGGGCC AGCTCGGCGG CGTGAGCTTC
AATTTCAACG GCTGGGGCAA CAAGCAGGAG CACGACCAGG ACGCGGAAGT CGCGCCGTTC
GTGGCGGAGC GCGCCGGTGC ACGGCTGCTC GACACACGGC TGGTGCTCGA AGGCGGCGGC
ATCGAGGTGG ACGGCGAAGG CACGGCGATC ATCACGCGCA GCTGCGTGCT CAATTCGAAC
CGCAATCCGG GCGTCGGCCA GGCACAGTGC GAGGCGGAGC TGAGCCGGCT GCTCGGGCTG
AAGAAGATCA TCTGGCTGCC GGGCATCGCG GGCAAGGACA TCACCGACGG GCATACCGAT
TTCTATGCAC GCTTCACGAG CCCGGGCGTC GTGGTGGCGG GGCTCGATAC CGATCCGTCG
TCGTACGATC ACGCGGTGAC GCGGCAGCAT CTGGAGATCC TGCGGAAATC GACCGATGCG
AAGGGCCGCC CGTTGAAAGT CGTCGTACTG CCGGGCCCGA AGTCCGTCCG GCATCAATAC
GAGAACGAGG AATTCGCGGC AGGTTATATC AACTTCTACG TGTGCAACCG CGCAGTGATC
GCCCCGCAAT TCGGCGACAG CCGCGCCGAC CGCAATACGC GCGACACGCT CGTCGACCTG
TTTCCGGGGC GCGAGGTCAT CCAGCTGAAC ATCGACGGCA TCGCCGCGGG CGGCGGCGGC
ATCCACTGCA CCACGCAGCA GCAGCCGGCC TGA
 
Protein sequence
MTTRRQLLKR GLSVSGAALA ASALGGLLGR AAHAQQGATW HMPDEGAPHT ATWMAFGPSE 
DIWGARLLPV ARANLAAIAK AIAAHEPLKM LVREQDYAIA SRLCGSSVEL VQHPVDDLWM
RDTGPVFVKN ASGQLGGVSF NFNGWGNKQE HDQDAEVAPF VAERAGARLL DTRLVLEGGG
IEVDGEGTAI ITRSCVLNSN RNPGVGQAQC EAELSRLLGL KKIIWLPGIA GKDITDGHTD
FYARFTSPGV VVAGLDTDPS SYDHAVTRQH LEILRKSTDA KGRPLKVVVL PGPKSVRHQY
ENEEFAAGYI NFYVCNRAVI APQFGDSRAD RNTRDTLVDL FPGREVIQLN IDGIAAGGGG
IHCTTQQQPA