Gene BURPS1710b_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1946 
Symbol 
ID3691533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2119823 
End bp2120833 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content73% 
IMG OID637728402 
Producthypothetical protein 
Protein accessionYP_333345 
Protein GI76808991 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2064] Flp pilus assembly protein TadC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000366161 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCCA GCCGCCTCGG CGCAATCGCG CTCGTTCTCG GCGCAATCGG CGTGCTGATG 
CTCGCCGCGC TCGCGATCAT GCAGGCCGTG CTCGCGCGGC GCACCGGCCG CACGCTCGCG
GACGCGCTCG ATCAGCGCGC CGCCGCGTTG GAGGCGGCCG CCGCGCGGGT CGCGGCGGGG
GCGGCCGGCG CGGCGCGCGC GGGCATGCCC GAGGCGGCGC CTGACGCGCG CCGTCCGCGC
TTCGCGGCGC TGCTCGATCG CGCGGGCCGG TTCGGAATGC GGCTGCTCGA TACGCGGCTC
GGCAAGCAGA TCGTCGCCGA CGAAGACCGG ATGCTGCTCG AACAGTGCGG CTACGTCGAC
GCGCACACGC GCGGCATCTT CCTGAGCGCG CGGATCGCGT GTGCGATCGC GCTGCCCGCC
GCCGTCGCGC TCGTCGGCGG CGAGCCGGTC CGCACGCATC TGGGCGCGTG GGTCGCGCTG
TCGGTGATCG CCGGCTTCAT GCTGCCGAAG ACCTACGTGC GCCGCCGCGC GGCGGCGCGC
CGCCAGTCCG TCGTCGACGA GATGCCGCTG CTCGTCGACA TGCTGCGGCT CTTGCAGGGC
GTCGGGCTGT CGCTCGACCA GAGCATCCAG GTCGTCACCA ACGACTTCAG GGGGATGCTG
CCCGTGCTGT CGTCGGAGCT CGGGATCGCG CAGCGGCAGT TCGTCGCGGG GCGCACGCGC
GAGCAGTCGC TGCAGCGTCT CGCGACGAGC TTCGACAACG AGGACCTGCG CGCGATCGTG
CGCCTGCTGA TCCAGGTCGA CAAGCACGGC GGCGCGGTGC AGGAGCCGCT CAAGCAGTTC
GGCGACCGGC TGCGCGAAGT GCGCCGCGCG ATGCTGCGCG AGCGCATCGG CCGCCTCACG
GTGAAAATGA CGGGCGTGAT GATTCTCACG CTGCTGCCCG CGCTGTTCAT CGTGACGGCG
GGGCCGGGGA TGCTCGCCGT CACGCATGCG CTCACGGCCG CGCGCCGCTA G
 
Protein sequence
MDPSRLGAIA LVLGAIGVLM LAALAIMQAV LARRTGRTLA DALDQRAAAL EAAAARVAAG 
AAGAARAGMP EAAPDARRPR FAALLDRAGR FGMRLLDTRL GKQIVADEDR MLLEQCGYVD
AHTRGIFLSA RIACAIALPA AVALVGGEPV RTHLGAWVAL SVIAGFMLPK TYVRRRAAAR
RQSVVDEMPL LVDMLRLLQG VGLSLDQSIQ VVTNDFRGML PVLSSELGIA QRQFVAGRTR
EQSLQRLATS FDNEDLRAIV RLLIQVDKHG GAVQEPLKQF GDRLREVRRA MLRERIGRLT
VKMTGVMILT LLPALFIVTA GPGMLAVTHA LTAARR