Gene BURPS1710b_2842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2842 
SymbolastB 
ID3691235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3151226 
End bp3152566 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID637729298 
Productsuccinylarginine dihydrolase 
Protein accessionYP_334226 
Protein GI76810086 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3724] Succinylarginine dihydrolase 
TIGRFAM ID[TIGR03241] succinylarginine dihydrolase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCTA AAGAAGCCAA TTTCGACGGG CTCGTCGGCC CGACCCATAA CTACGCGGGA 
TTGTCGTTCG GCAACGTCGC GTCGCTGTCG AACGAAAAGT CCGACGCGAA CCCGAAGGCG
GCCGCCAAGC AGGGGCTGCG CAAGATGAAG CAGCTCGCGG ACCTCGGTTT CGCGCAGGGC
GTGCTGCCGC CGCAGGAGCG GCCGTCGCTG CGCCTGTTGC GCGAGCTCGG CTTCTCCGGC
AAGGACGCCG ACGTGATCGC GAAGGCCGCG AGGCAGGCGC CCGAGCTGCT CGCCGCCGCG
AGCTCCGCAT CGGCGATGTG GACCGCGAAC GCGGCGACGG TGAGCCCGTC CGCCGATACG
AGCGACGCCC GCGTGCATTT CACGCCGGCG AACCTGTGCA GCAAGCTGCA TCGCGCGATC
GAGCACGAAT CGACGCGCCG CACGCTCGCC GCGATCTTCG CGGACGAAGC GCGCTTCGCG
GTGCACGACG CGCTGCCCGG CACGCCCGCG CTCGGCGACG AGGGCGCGGC GAACCATACG
CGCTTTTGCG CGGAGTACGG CGCGCCCGGC GTCGAGTTCT TCGTGTACGG CCGCGCCGAA
TACCGCCGCG GGCCGGAGCC GACGCGTTTT CCGGCGCGCC AGACGTTCGA GGCGAGCCGC
GCGGTCGCGC ATCGCCACGG CCTGCGCGAG GAAGCGACGA TCTACGCGCA GCAGCGCCCG
GACGTGATCG ACGCGGGCGT GTTCCACAAC GACGTGATCG CGGTCGGCAA TCGCGACACG
CTGTTCTGCC ACGAACATGC GTTCGTCGAC CGGCAGGCGG TGTACGACGC GCTCGCCGCG
TCGCTCGGCG CGCTCGGCGC GCAGTTGAAC GTGATCGAGG TGCCGGATCG CGCGGTGAGC
GTCGCCGACG CGGTGGGCTC GTACCTGTTC AACAGCCAGC TGCTCGCGCG CGAAGACGGC
ACGCAGATGC TGGTCGTGCC GCAGGAATGC CGCGAGAACG CGAACGTGGC CGCGTATCTC
GACGCGCTCG TCGCCGGCAA CGGGCCGATT CGCGACGTGC GCGTGTTCGA TCTGCGCGAG
AGCATGAAGA ACGGCGGCGG GCCCGCGTGC CTGCGGCTGC GTGTCGTGCT GAACGATGCC
GAGCGCGCGG CGGTGAAGCC GAATGTGTGG ATCGGCGACG CGCTGTTCGC ATCGCTCGAC
GCATGGATCG ACAAGCATTA CCGCGACCGG CTGTCGCCCG TCGATCTCGC CGACCCCGCG
CTGCTCGACG AATCGCGCAC CGCGCTCGAC GAATTGACGC AGATCCTCGG CCTCGGCTCG
CTCTATGACT TCCAGCGCTG A
 
Protein sequence
MNAKEANFDG LVGPTHNYAG LSFGNVASLS NEKSDANPKA AAKQGLRKMK QLADLGFAQG 
VLPPQERPSL RLLRELGFSG KDADVIAKAA RQAPELLAAA SSASAMWTAN AATVSPSADT
SDARVHFTPA NLCSKLHRAI EHESTRRTLA AIFADEARFA VHDALPGTPA LGDEGAANHT
RFCAEYGAPG VEFFVYGRAE YRRGPEPTRF PARQTFEASR AVAHRHGLRE EATIYAQQRP
DVIDAGVFHN DVIAVGNRDT LFCHEHAFVD RQAVYDALAA SLGALGAQLN VIEVPDRAVS
VADAVGSYLF NSQLLAREDG TQMLVVPQEC RENANVAAYL DALVAGNGPI RDVRVFDLRE
SMKNGGGPAC LRLRVVLNDA ERAAVKPNVW IGDALFASLD AWIDKHYRDR LSPVDLADPA
LLDESRTALD ELTQILGLGS LYDFQR