Gene BURPS1710b_2128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2128 
SymbolarcA 
ID3689684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2328496 
End bp2329752 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID637728584 
Productarginine deiminase 
Protein accessionYP_333523 
Protein GI76808727 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.853233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAAG CCATCCCTCA AGTCGGTGTC CATTCCGAAG TCGGCAAGCT GCGCAAGGTG 
CTCGTCTGCT CGCCGGGCCT CGCGCATCAG CGCCTCACGC CGAGCAACTG CGACGAGCTG
CTGTTCGACG ACGTGATGTG GGTGAACCAG GCGAAGCGCG ATCACTTCGA TTTCGTCTCG
AAGATGCGCG AGCGCGGCGT CGAGGTGCTC GAGATGCACA ACCTGCTGAC CGAGACCGTG
CAGAACCCGG CCGCGCTCAA GTGGATCCTC GATCGCAAGA TCACGCCGGA TAACGTCGGC
ATCGGCCTCG TCGACGAAGT GCGCGCGTGG CTCGAGGGCC TCGAGCCGCG TGCGCTCGCC
GAGTTCCTGA TCGGCGGGGT CGCGGCGAGC GACATTGCCG GCGCCGAGCG CTCGAAGGTG
CTCACGCTGT TTCGCGACTA TCTCGGCAAG TCGTCGTTCG TGCTGCCGCC GCTGCCGAAC
ATGATGTTCA CGCGCGACAC GTCGTGCTGG ATCTACGGCG GCGTCACGCT CAACCCGATG
CACTGGCCCG CGCGCCGGCA GGAGACGCTC CTCGTCGCCG CCGTCTACAA ATTCCACCCG
GCGTTCACCG ACGCGAAGTT CGACGTCTGG TACGGCGATC CCGACCGCGA TCACGGCATG
GCGACGCTCG AAGGCGGCGA CGTGATGCCG ATCGGCCGCG GCGTCGTGCT CGTCGGCATG
GGCGAGCGCA CGTCGCGCCA GGCGGTCGGC CAGCTCGCGC AGGCGCTGTT CGCCAAGGGC
GCGGCCGAGC GCGTGATCGT CGCCGGGCTG CCGAACTCGC GCGCGTCGAT GCACCTCGAC
ACCGTGTTCA GTTTCTGCGA CCGCGATCTC GTCACGGTCT TCCCCGAAGT CGTGAACCGG
ATCGTGCCGT TCACGCTGCG CCCGGGCGGC GATGCGCGTT ACGGCATCGA CATCGAGCGC
GAGGACAAGC CGTTCGTCGA CGTCGTCGCG CAGGCGCTCG GCCTCAAATC GCTGCGCGTC
GTCGAGACGG GCGGCAACGA TTTCGCGGCC GAACGCGAGC AATGGGACGA CGGCAACAAC
ATGGTGTGCA TCGAGCCGGG CGTCGTCGTC GGCTACGACC GCAACACGTA CACGAACACG
CTGCTGCGCA AGGCGGGCGT CGAGGTGATC ACGATCGGCT CGAGCGAGCT CGGCCGCGGC
CGAGGCGGCG GCCACTGCAT GACCTGCCCG GTGCTGCGCG ACCCCGTCGA CTACTGA
 
Protein sequence
MSQAIPQVGV HSEVGKLRKV LVCSPGLAHQ RLTPSNCDEL LFDDVMWVNQ AKRDHFDFVS 
KMRERGVEVL EMHNLLTETV QNPAALKWIL DRKITPDNVG IGLVDEVRAW LEGLEPRALA
EFLIGGVAAS DIAGAERSKV LTLFRDYLGK SSFVLPPLPN MMFTRDTSCW IYGGVTLNPM
HWPARRQETL LVAAVYKFHP AFTDAKFDVW YGDPDRDHGM ATLEGGDVMP IGRGVVLVGM
GERTSRQAVG QLAQALFAKG AAERVIVAGL PNSRASMHLD TVFSFCDRDL VTVFPEVVNR
IVPFTLRPGG DARYGIDIER EDKPFVDVVA QALGLKSLRV VETGGNDFAA EREQWDDGNN
MVCIEPGVVV GYDRNTYTNT LLRKAGVEVI TIGSSELGRG RGGGHCMTCP VLRDPVDY