Gene BURPS1106A_3319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3319 
SymbolrecN 
ID4900625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3236269 
End bp3237918 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content72% 
IMG OID640136545 
ProductDNA repair protein RecN 
Protein accessionYP_001067556 
Protein GI126455415 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.303665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCGCC ACCTCTCGAT CCGCGATTTC GTCATCGTCG CCGCGCTCGA TCTCGAATTC 
GACAGCGGCT TCACCGTTTT CTCAGGCGAA ACGGGCGCCG GCAAATCGAT CCTGATCGAT
GCGCTCGCGC TCGCGCTCGG CGAGCGCGCC GACGCGAGCG TCGTGCGCAC CGGCAGCGGC
CGGGCCGACA TCAGCGCCGA ATTCACGCCG CACGACCGCG TCGCGCGCTG GCTCGACGAG
CACGCGTTCG ACGCCGACGA CACGGTGATG CTGCGGCGCG TCGTCGACGC GAACGGCCGC
TCGCGCGCCT TCATCAACGG CACGAGCGCG ACGCTCGCGC AACTGCGCGA AGTGGGCGAG
ATGCTCGTCG ACATCCACGG CCAGCACGCG CATCAGTTGC TGATGCGCGC GGACGCGCAG
CGCGAGCTGT TCGACACGCA CGCGGGGCTC GCGGCCGACG CGGCCGCCGT CGCGCGCGGC
TATCGCGCGT GGCGCGACGC GACGCACGCG ATCGACGCCG CACAGGCGCA CGAGCGCGAG
CGCCAGCTCG AACGCGAAAA GCTCGCGTGG CAGCTCGCCG AGCTCGACAA GCTCGCGCCG
CAGCCGGGCG AATGGGACGA GATCACCGCC GAGCACAAGC GGCTCACGCA TTCGGCGAAC
CTGATCGACG GCGTGCAGGG CGCGCTCGGC GCGATCTCCG AATCCGACGA CGCGATGCTC
ACGCAACTGG GCGCGATCGT GTCGAGGCTG AGGAGCCTCG CCGAATACGA CCCCGCGCTC
AACGACGCGC TCGCATCGCT CGAGCCGGCC GAGATCCAGC TGCAGGAGGC TTCGTATTCG
CTGTCGCACT ACGCGCAGCG GCTCGACCTC GACCCGGACC GGCTCGCGCA GGTCGAGACG
CGGCTCGACG CGCTGCACTC GACCGCGCGC AAGTTCCGGC TGCCGCCCGA GACGCTGCAC
GACGAGCACG AGGCGCGCCG CGCTCAGCTC GCCGAGCTCG ACGCCGCGGC CGATCTGAGC
GCGCTGCAGG CGGTTGCCGA CAAGGCGAAG CAGGCGTATC TGGCCGACGC GCAGAAGCTG
TCGAAGGCGC GCGCGCAAGC GGCGAAGGCG CTCGGCGTGG CGGTGACCAC CGGCATGCAG
GAATTGTCGA TGGCGGGCGG CAGCTTCGAG GTCGCGCTCG TGCCGCTCGC CGAAGGCGGC
GCGCACGGGC TCGAGCAGGT CGAGTTCCGC GTCGCGGGCC ATGCGGGCGT GCCGCTGCGG
CCGCTCGCGA AGGTCGCCTC GGGCGGCGAG CTCGCGCGGA TCAGCCTCGC GCTCGCGGTG
ATCGCGAGCG CGGCGAGCCC GACGCCGACG CTCATCTTCG ACGAAGTCGA CACGGGCATC
GGCGGCGGCG TCGCCGAGGT GGTCGGGCGG CTGCTGCATC AGCTCGGACA GATGCGGCAG
GTGCTGTGCG TCACGCACCT GCCGCAGGTC GCCGCGCGCG GCGACCATCA CTTTCAGGTC
GCGAAGGGCG AGGACGGCGA AGGCGGCACC GTGTCGACGG TCGTGCCGCT CGATCGCGCG
AGCCGGATCG AGGAAGTCGC GCGGATGCTG GGCGGCCTCG AGATCACCGC GACGACGCGC
AAGCATGCGA AGGAAATGCT CACCGCGTGA
 
Protein sequence
MLRHLSIRDF VIVAALDLEF DSGFTVFSGE TGAGKSILID ALALALGERA DASVVRTGSG 
RADISAEFTP HDRVARWLDE HAFDADDTVM LRRVVDANGR SRAFINGTSA TLAQLREVGE
MLVDIHGQHA HQLLMRADAQ RELFDTHAGL AADAAAVARG YRAWRDATHA IDAAQAHERE
RQLEREKLAW QLAELDKLAP QPGEWDEITA EHKRLTHSAN LIDGVQGALG AISESDDAML
TQLGAIVSRL RSLAEYDPAL NDALASLEPA EIQLQEASYS LSHYAQRLDL DPDRLAQVET
RLDALHSTAR KFRLPPETLH DEHEARRAQL AELDAAADLS ALQAVADKAK QAYLADAQKL
SKARAQAAKA LGVAVTTGMQ ELSMAGGSFE VALVPLAEGG AHGLEQVEFR VAGHAGVPLR
PLAKVASGGE LARISLALAV IASAASPTPT LIFDEVDTGI GGGVAEVVGR LLHQLGQMRQ
VLCVTHLPQV AARGDHHFQV AKGEDGEGGT VSTVVPLDRA SRIEEVARML GGLEITATTR
KHAKEMLTA