Gene BURPS1106A_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1987 
SymbolarcA 
ID4903100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1951295 
End bp1952572 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content67% 
IMG OID640135217 
Productarginine deiminase 
Protein accessionYP_001066252 
Protein GI126453511 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.223306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCGAAG AGGTAATCAA CATGTCCCAA GCCATCCCTC AAGTCGGTGT CCATTCCGAA 
GTCGGCAAGC TGCGCAAGGT GCTCGTCTGC TCGCCGGGCC TCGCGCATCA GCGCCTCACG
CCGAGCAACT GCGACGAGCT GCTGTTCGAC GACGTGATGT GGGTGAACCA GGCGAAGCGC
GATCACTTCG ATTTCGTCTC GAAGATGCGC GAGCGCGGCG TCGAGGTGCT CGAGATGCAC
AACCTGCTGA CCGAGACCGT GCAGAACCCG GCCGCGCTCA AGTGGATCCT CGATCGCAAG
ATCACGCCGG ATAACGTCGG CATCGGCCTC GTCGACGAAG TGCGCGCGTG GCTCGAGGGC
CTCGAGCCGC GTGCGCTCGC CGAGTTCCTG ATCGGCGGGG TCGCGGCGAG CGACATTGCC
GGCGCCGAGC GCTCGAAGGT GCTTACGCTG TTTCGCGACT ATCTCGGCAA GTCGTCGTTC
GTGCTGCCGC CGCTGCCGAA CATGATGTTC ACGCGCGACA CGTCGTGCTG GATCTACGGC
GGCGTCACGC TCAACCCGAT GCACTGGCCC GCGCGCCGGC AGGAGACGCT CCTCGTCGCC
GCCGTCTACA AATTCCACCC GGCGTTCACC GACGCGAAGT TCGACGTCTG GTACGGCGAT
CCCGACCGCG ATCACGGCAT GGCGACGCTC GAAGGCGGCG ACGTGATGCC GATCGGCCGC
GGCGTCGTGC TCGTCGGCAT GGGCGAGCGC ACGTCGCGCC AGGCGGTCGG CCAGCTCGCG
CAGGCGCTGT TCGCCAAGGG CGCGGCCGAG CGCGTGATCG TCGCCGGGCT GCCGAACTCG
CGCGCGTCGA TGCACCTCGA CACCGTGTTC AGCTTCTGCG ACCGCGATCT CGTCACGGTC
TTCCCCGAAG TCGTGAACCG GATCGTGCCG TTCACGCTGC GCCCGGGCGG CGATGCGCGT
TACGGCATCG ACATCGAGCG CGAGGACAAG CCGTTCGTCG ACGTCGTCGC GCAGGCGCTC
GGCCTCAAAT CGCTGCGCGT CGTCGAGACG GGCGGCAACG ATTTCGCGGC CGAACGCGAG
CAATGGGACG ACGGCAACAA CATGGTGTGC ATCGAGCCGG GCGTCGTCGT CGGCTACGAC
CGCAACACGT ACACGAACAC GCTGCTGCGC AAGGCGGGCG TCGAGGTGAT CACGATCGGC
TCGAGCGAGC TCGGCCGCGG CCGAGGCGGC GGCCACTGCA TGACCTGCCC GGTGCTGCGC
GACCCCGTCG ACTACTGA
 
Protein sequence
MIEEVINMSQ AIPQVGVHSE VGKLRKVLVC SPGLAHQRLT PSNCDELLFD DVMWVNQAKR 
DHFDFVSKMR ERGVEVLEMH NLLTETVQNP AALKWILDRK ITPDNVGIGL VDEVRAWLEG
LEPRALAEFL IGGVAASDIA GAERSKVLTL FRDYLGKSSF VLPPLPNMMF TRDTSCWIYG
GVTLNPMHWP ARRQETLLVA AVYKFHPAFT DAKFDVWYGD PDRDHGMATL EGGDVMPIGR
GVVLVGMGER TSRQAVGQLA QALFAKGAAE RVIVAGLPNS RASMHLDTVF SFCDRDLVTV
FPEVVNRIVP FTLRPGGDAR YGIDIEREDK PFVDVVAQAL GLKSLRVVET GGNDFAAERE
QWDDGNNMVC IEPGVVVGYD RNTYTNTLLR KAGVEVITIG SSELGRGRGG GHCMTCPVLR
DPVDY