Gene BURPS1710b_2790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2790 
SymbolhutG 
ID3691558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3095650 
End bp3096684 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content71% 
IMG OID637729246 
ProductN-formylglutamate amidohydrolase 
Protein accessionYP_334174 
Protein GI76809406 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3741] N-formylglutamate amidohydrolase 
TIGRFAM ID[TIGR02017] N-formylglutamate amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCGAAC GCCGGACGCT ACGAACGGAC GGCGGCGCGG ATGGCCTGCG GCGCGGTCAG 
CATGAATCCG GCGCGCGGCG TCGAGCGCTA CGGCGGGCCG CCGCTCGCGC CTCGGCGCAA
TGCCCGCCCG TCGAGCGTTT CGCGCGCACG ATACCGCGGC CGGCCGCCGC GTGCACCATG
GAGCCGCCGG CGGGCCGGGG CGCGCCGCAT CATCAACGAG GTCGAACGAT CATGAACACC
GCATCGCAAC CGCCGGTATT CACGCTGCAT CGCGGCACGC TGCCGCTGCT CGTGTCGATA
CCGCACGCGG GCACTCACCT TCCCGATGAC ATCGCCGCGA CGATGACGCC CGTCGCTCGC
CACGTCGACG ATTGCGACTG GCATCTCGAG CGTCTGTACG ATTTCGCGAA GACGCTCGGC
GCGTCGGTGC TCGTGCCTTC GCACGCGCGC TACGTCGTCG ATCTGAACCG CCCGCCGGAT
GACGCGAATC TCTACCCGGG GCAGGACACG ACGGGCCTCG TGCCGGTCGA CACGTTCGAC
AGGGCGCCGC TGTACGCGCA CGGCCACGAG CCGACCGTCA CCGAGATCGC GCGCCGCCGC
GAACGCTATT GGGCGCCGTA TCACGGCGCG CTCGCGGGCG AACTGCAACG GCTGAAGGAC
GCGCACGGCC GCGCGCTGCT GTGGGAGGCG CATTCGATCC GCTCGCACGT GCCGCGCTTC
TTCGACGGCC GGCTGCCCGA CTTCAACTTC GGCACGTCGA GCGGCGCGAG CGCCGCGCCC
GGGCTCGCCG ACAAGCTGGC CGCGCTCGTC GACCGGCACG GCGGCTATAC GGCGATCGCG
AACGGGCGCT TCAAGGGCGG CTACATCACG CGTCACTACG GCGCGCCGGA GCAGGGCGTG
CAGGCGGTGC AGCTCGAACT CGTGCAGGCG ACCTACATGG ACGAGACGCG GCCTTATTCG
TACGACGAAA CCAGGGCGCG GCGGATCGCG CCGCTGCTCG AAGCGCTCGT GAGCGCCGCG
CTCGAGCATC ATTGA
 
Protein sequence
MRERRTLRTD GGADGLRRGQ HESGARRRAL RRAAARASAQ CPPVERFART IPRPAAACTM 
EPPAGRGAPH HQRGRTIMNT ASQPPVFTLH RGTLPLLVSI PHAGTHLPDD IAATMTPVAR
HVDDCDWHLE RLYDFAKTLG ASVLVPSHAR YVVDLNRPPD DANLYPGQDT TGLVPVDTFD
RAPLYAHGHE PTVTEIARRR ERYWAPYHGA LAGELQRLKD AHGRALLWEA HSIRSHVPRF
FDGRLPDFNF GTSSGASAAP GLADKLAALV DRHGGYTAIA NGRFKGGYIT RHYGAPEQGV
QAVQLELVQA TYMDETRPYS YDETRARRIA PLLEALVSAA LEHH