Gene BURPS668_A1216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1216 
SymbolcolA 
ID4885883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1158037 
End bp1159974 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content64% 
IMG OID640131155 
Productcollagenase 
Protein accessionYP_001062213 
Protein GI126444490 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000787421 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAGG TGTTCCGAAA AACCCGCCGC TGGTCCGCCG TGGCGGCGCT ATCGGCATTC 
GTGGGGCTGG CCGGCGCCGC GTCGGCCAAT ACGCAGCCGA TGCAACCGAC GCAACAAAAG
CAGGCGCGCA TGCCGCGCCT GCCGCAGAAC CTGCCGGTTT CGCCCGAGCA GGCCGAATAC
AATCTGCCGC TCAGCGAGCA GGATCGTGCG GCGCTCACCA GGCCTTCGCC GCTCAAGCAG
CCGGCCAAGC GCGGCAAACG CAGCGCGCCG GGCGCCGATT GCCGCGACAT GTCGGTGATG
ACTCAGTATC GCGGCGCCGC GCTCGCCGAT TACATCGCGA ATCTTCCCGA TTATGAATGC
CATTACGGCC TGTTCTCGGT CGATAAAACC CTGGCCGCGC AGATTTTCAG TGCGGAAAAT
GTGCATGCCG TCGCGAGCCG TTTCGTGCAG GATATCTATC GCTATGATGC GAGCAACTTG
ATTCTGGTCA ATTTGCTGAT TTATCTGCGT TCCGCTTATT ACCAATATGA TGTATCGGGC
ATTGCCAATC CGATTCCGAA TCTCGCGGTA TGGCTGCGCC CGTATATCAA GCAGAGCCTG
GAGGGCGCCG CGCTCTATCG AGAGAACGCG CGCGCGCCGA GCACCGCGAA CGAGCTGATG
AAGTTCATCA CGAACATGAA GGACGAGGCG TTCTATCTGC CCACGCTGAA GGCGCGCATT
GCGTTCTACA CGGCGAGCGC GACGAATCCG CAGGCGGCGG CGCCGCTGTT GCAGCCGAGC
GCGGCGGGCG GCTTCACCGG CCTGCTCACG GTGTTCTTCT ATGCGCATCA GCGCAGCGGC
GCGCAGCCGA TGCTCGATAG CGACGCGACG CTGCCCGAGA CGCTCAACCG CTTCGTCACC
GCGAACCGCG CGAGCCTGTC GAACACGAGC GCCGCGTACC AGCTCGCGGA CGCGGCGCGC
GAAACGTTTC GCTTCCTGCG CTACCCGGCG CAGAAGCCGC GCGTGAAGAA GATGATCCAG
GACATGCTCG CGTCGACGAG CATGACGGGC GCGGACAGCG ACCTGTGGCT CGCGGCGGCG
GAAGCGGTCG ACTATGGCGA TCCGGGCAAC TGCGCGGACT ACGGCACGTG CGACTACAAG
AAGCGGCTCA CCGACGCGGT GCTCACGCAT CGTTACGCGT GCAACGCGGG CGTGCGCATT
CTCGCGCAGG ACATGACGCT GCCGCAGTTG CAGTCGGTCT GCACGTCGGT CGCGCAGCAG
GACGACTACT TCCACCGGAT GATGAAGACC GGGCGCAAGC CGGTGGCGGG CGACCGCAAC
GATACGATCG AGCTCGTCAT CTTCGACGAC TACGCGAACT ATCGAAAATA TGCTTCGGTG
ATCTACGGCA TCAGCACCGA CAACGGCGGC ATGTATCTCG AAGGCGATCC GTCCGCGCCC
GGCAACCAGG CGCGCTTCAT TGCGCACGAG GCGTCGTGGT TGCGGCCCGA GTTCAAGGTC
TGGAACCTCG AGCACGAGTT CACGCACTAT CTCGACGGCC GCTACGACAT GGCGGGCGAT
TTCGCGGCGA GCACCGCGAA GCCGACCGTC TGGTGGATCG AGGGTCTCGC CGAATATCTG
TCGAGAAAGA ACGACAATCA GGAGTCGATC GATGCGGCGC GCACGGGCGC GTACCGCTTC
TCGGACGTGC TCGGCACGCT GTATTCGTCG AGCGACTACG TCGCGCGCGC CTACCGTTGG
GGCTACATGG CGACACGCTT CATGTTCGAG CGCCATCGCG CGGACGTGGA CACGATCGTG
TCGCGCTTCC GGGTGGGCGA CTACGACGGC TACGCGAACT ACGTCGCGTA CATCGGCAAC
CGCTACGACG GCGAGTTCGT CGATTGGGCG CGCGCGGCGA CCACGGCGGG CGAGCCGCCG
CTGCCGACGA AGCGTTGA
 
Protein sequence
MTEVFRKTRR WSAVAALSAF VGLAGAASAN TQPMQPTQQK QARMPRLPQN LPVSPEQAEY 
NLPLSEQDRA ALTRPSPLKQ PAKRGKRSAP GADCRDMSVM TQYRGAALAD YIANLPDYEC
HYGLFSVDKT LAAQIFSAEN VHAVASRFVQ DIYRYDASNL ILVNLLIYLR SAYYQYDVSG
IANPIPNLAV WLRPYIKQSL EGAALYRENA RAPSTANELM KFITNMKDEA FYLPTLKARI
AFYTASATNP QAAAPLLQPS AAGGFTGLLT VFFYAHQRSG AQPMLDSDAT LPETLNRFVT
ANRASLSNTS AAYQLADAAR ETFRFLRYPA QKPRVKKMIQ DMLASTSMTG ADSDLWLAAA
EAVDYGDPGN CADYGTCDYK KRLTDAVLTH RYACNAGVRI LAQDMTLPQL QSVCTSVAQQ
DDYFHRMMKT GRKPVAGDRN DTIELVIFDD YANYRKYASV IYGISTDNGG MYLEGDPSAP
GNQARFIAHE ASWLRPEFKV WNLEHEFTHY LDGRYDMAGD FAASTAKPTV WWIEGLAEYL
SRKNDNQESI DAARTGAYRF SDVLGTLYSS SDYVARAYRW GYMATRFMFE RHRADVDTIV
SRFRVGDYDG YANYVAYIGN RYDGEFVDWA RAATTAGEPP LPTKR