Gene BURPS1106A_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2042 
Symbol 
ID4902932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2024691 
End bp2025782 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content68% 
IMG OID640135272 
Producthypothetical protein 
Protein accessionYP_001066307 
Protein GI126451937 
COG category[S] Function unknown 
COG ID[COG3535] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGCA TCCTGTCTTC GAAGGACGTT GAAGCCGCCG TCAAGGGCGG CTCGGTTTTC 
GCATGCGGCG GCGGCGGCTG GGCCGACCAC GGCCGCGAAC TCGGCATGCT CGCGGTCACG
ATCGGCCGCC CCGAACTGGT CGCCATCGAC GAATTGCCGG ACGACGCATG GATCGCCACG
GCCGCCGCAA TCGGCGCGCC GGGCGGCCTC ACCGACTGGC AAATGCTCGG CGCCGACTAC
GTGAAGGCCG CTCAGCTCGT GCAGGAAGCG CTCGGCGCAC CGCTCGCGGG GCTCATCATC
GGGCAAAACG GCATGTCGAG CACGCTTAAC GCGTGGCTGC CGTCCGCGCT GCTCGGCGCC
AAGGTCGTCG ACGCGGTCGC CGATCTGCGC GCCCATCCGA CCGGCGACAT GGGCTCGCTC
GGTCTCGCGT CGAGCTCCGA ACCGATGATC CAGGCAGCCG CCGGAGGCAA CCGCGCGAAG
CATGCGTACA TGGAAGTCGT CGTGCGCGGC GCGACCGCCA AGGTATCGCC GGTATTGCGC
AAGGCCGCCG ACATGGCCGG CGGCTTCATC GCGAGCTGCC GCAACCCCAT CCGCGCATCG
TACGTGCGCC GGCATGCGGC GCTCGGCGGC ATCAGTCGCG CGCTCGCGCT CGGCGAAGCA
ATCATCGACG CCGAGCGGCG CGGCGGCAGC GCGGTGATCG ATGCGATCTG CGCAGCCACG
CAAGGCGAGA TCATCGTGAG CGGCAAAGTC GAGCGCAATA CGCTCGCCTA CACGCGCGAG
GCGTTCGACG TCGGACTCGT CTATCTCGGC GAGGGCGCCA AGCGCGCGGT CATTCATGTG
ATGAACGAAC ACATGGCGGT AGACGACGCG CACGGCGAGC GGATCGCGAC CTACCCCGAC
GTGATCACGA CGCTCGACAG CGACGGCCGC CCTGTCAGCG CCGGGCAGTT AAAGGAAGGG
ATGGAGATTC ACGTGCTGCG GGTGACGAAG ACACACATTC CGCTGTCGTC GTCGGTGTTC
GATCCCGCGA TCTACCCGCC GGTCGAAACC GCGCTCGGCA TCTCGATCGC CGACTATGCG
CTCGCCCGCT GA
 
Protein sequence
MGRILSSKDV EAAVKGGSVF ACGGGGWADH GRELGMLAVT IGRPELVAID ELPDDAWIAT 
AAAIGAPGGL TDWQMLGADY VKAAQLVQEA LGAPLAGLII GQNGMSSTLN AWLPSALLGA
KVVDAVADLR AHPTGDMGSL GLASSSEPMI QAAAGGNRAK HAYMEVVVRG ATAKVSPVLR
KAADMAGGFI ASCRNPIRAS YVRRHAALGG ISRALALGEA IIDAERRGGS AVIDAICAAT
QGEIIVSGKV ERNTLAYTRE AFDVGLVYLG EGAKRAVIHV MNEHMAVDDA HGERIATYPD
VITTLDSDGR PVSAGQLKEG MEIHVLRVTK THIPLSSSVF DPAIYPPVET ALGISIADYA
LAR