Gene BURPS668_A2456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2456 
SymbolssuD 
ID4888071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2374014 
End bp2375309 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID640132393 
Productnitrilotriacetate monooxygenase component A 
Protein accessionYP_001063450 
Protein GI126442459 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00745552 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGAGC ACCGTCAGAT GCACCTGAAC TGCCATATCG TCGGCGTCGG CCAGCATCCG 
GCCGGATGGC GCACGCTGCG CAACACGCGT GCGATCGTCG ATCCGGATTT CTATCGGCGC
GTCGCGCGCG TCGCCGAGCA GGGCAAGTTC GACGCGCTCT TCTTCTCCGA TTCGCTTTCG
CTGCACGGCC ACGCTCATGG GCCGTCGCAG ATGCTCGATC CGCTCGTCGT CGCGTCGGCG
CTGGCGCTCG TCACCGAACG CATCGGGCTG ATCTGCACGG CGTCCACGAC GTTCAGCGAT
CCGTTTTCCC TCGCGCGCCG CTTCCTGTCG CTCGATCAGA TGAGCGGCGG CAGGGCGGGC
TGGAACGCGG TCACCACCTA CAATCCGGCC GCGGCCGCGA ATTTCGGCCC GGTGGCGTTG
CCGAGCGCGG CCGAGCGCTA CGCCCGCGCG GACGAGTTCA TCGACGTGAC GATCAAGCTG
TGGGAAAGCT GGGGCGAGCG CGCGCTCGTC GCCGATGCGG CGAGCGGCGT GTTCGCCGAT
CCCGCGCAGG TCCGGCGCAT CGATCATCAC GGCCGGCACT TCGACATCGC CGGGCCGCTG
AACGTGCCGC GCAGCCCGCA GGGGCGGCCG CTGCTCGTGC AGGCCGGCGC GTCCGAAGCC
GGCCTCGAGC TGGCGGCGAG GCATGCGGAC ATGGTGTTTA CGTCGCAGCA TTCGCTCGAA
GGCGCGCAGC GCTTCTACGC GGATCTCAAG TCGAGAGTGC ACGCGAAAGG AAGGGATCCC
GATCAATTCG GAATATTGCC GGGGTTGTAT CCGGTCATCG GCTCGACGAT GGCCGAGGCG
TGTGCGAGGA AGGACGAAAT GGACGCGTTG CGCGATCCGG GCGGCGACAT CGAGATGCTC
GCGCGCCAGT TCGGCATCCG GCCGGAGGAC CTCGCGCTCG ACGCGCCGCT GCCCTACGAG
CGGATCGCGC GCGCGTCGCG GGACCATGTG TCGCACGGCT TCGCCGCGCA GATGATCGGC
TTCGCGCGCG GGAAAAACCT GACGGTTCGC GAACTGATCG ACCACAACCT CGGCATGCAC
CGGATCGTCG TCGGCACGCC GGAGTCGATC GCCGACGATA TGACGCAATG GTTCGAGCAG
GGGGCGGCCG ACGGCTTCAA CCTGAACTTC GACGTATTCC CCGACGGGCT GCAGACGATG
GTCGAGCACG TCGTGCCGTT GCTGCAGAAG CGCGGGCTGT TCCGATCGGA TTATTCGGCG
CGTACGCTGA AGGGCCATCT CGGCGTGCGC AGTTGA
 
Protein sequence
MHEHRQMHLN CHIVGVGQHP AGWRTLRNTR AIVDPDFYRR VARVAEQGKF DALFFSDSLS 
LHGHAHGPSQ MLDPLVVASA LALVTERIGL ICTASTTFSD PFSLARRFLS LDQMSGGRAG
WNAVTTYNPA AAANFGPVAL PSAAERYARA DEFIDVTIKL WESWGERALV ADAASGVFAD
PAQVRRIDHH GRHFDIAGPL NVPRSPQGRP LLVQAGASEA GLELAARHAD MVFTSQHSLE
GAQRFYADLK SRVHAKGRDP DQFGILPGLY PVIGSTMAEA CARKDEMDAL RDPGGDIEML
ARQFGIRPED LALDAPLPYE RIARASRDHV SHGFAAQMIG FARGKNLTVR ELIDHNLGMH
RIVVGTPESI ADDMTQWFEQ GAADGFNLNF DVFPDGLQTM VEHVVPLLQK RGLFRSDYSA
RTLKGHLGVR S