Gene BamMC406_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_4007 
Symbol 
ID6179716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010552 
Strand
Start bp1019894 
End bp1020952 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content71% 
IMG OID641683777 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001810688 
Protein GI172063037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.188369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACGA CGAAACACGC GGCGAAGATG TCGCCCGCGC CGGGTTCCGG CAGCGCGAAG 
GACGCGCCGG CGGCGACGCC AGCAACACCC GCCGCGCCGA CCACCGTCGC GCCCGGCCGC
GTGCTGCGCG TGACGCACGA CACCGAATAC CGCTACGCGG CGCGCGTCGA ATCCGCGCAG
CACCAGGCGC GGCTGCAGCC GCTCGCCACG CCGCGCCAGC AGGTGCTGTC GTTCGCGCTC
GACATCGAGC CCGCCGCGGA GTCGGTCTGC ACCGAGATCG ATGCGTTCGG CAACGCGCGC
GCGTCGTTCG CGCTGAACCA GCCTCACGAT GCGCTGCTCG TGCGCAGCCG CAGCACGGTG
CGCGTGAGCG CGCCTGTATG GTCGCTAGGG GCACGCGGCG CACCGCCGCC CGCGATCGCG
AGCCCCGACG CCGAACATGC GATGGCGTGG GAAGCCGTGC GCGAGCGCTT GCAGTTTCGC
GCGGGACAGC CGTACGACGC GGCGAGCGAG TTCGTGTTTG CGTCGCCGCA CGTGGCGTGC
GATCCCGAAC TGGCCGCGTA TGCGGCCGCG AGCTTCACGC CGGGGCGGCC GCTCGTGCAG
GCCGCGTGGG ACCTGATGCG GCGAGTTCAC GCGGATTTCG CGTATACGCC GAACAGCACC
GACATCACGA CGACCGCGCT CGATGCGTTG CGATTGCGCA AGGGCGTCTG CCAGGATTTC
GCGCACGTGA TGATCGGCGC GCTGCGCTCG CTCGGACTCG CCGCGCGCTA TGTGAGCGGG
TATCTGCTGA CGCAGCCGCC GCCCGGGCAG CCGCGGCTGA TCGGCGCGGA TGCGTCGCAT
GCATGGATCG ACGTGTATGA CCCGGCATGG CCCGAAGATG GCGGCTGGCT GCAACTCGAC
CCCACCAACG ACCGTGCGCC GGGTGACGAC TACGTGATGC TGTCGATCGG CCGCGACTAT
GCAGACGTGA CGCCGTTGCG CGGCGTGATA CGCGGCGGCG GCGCGGATCA GGAGCTGAAG
GTCGGCGTGA CGGTCGAGCC GCTCGACGAA GCGTCGTGA
 
Protein sequence
MVTTKHAAKM SPAPGSGSAK DAPAATPATP AAPTTVAPGR VLRVTHDTEY RYAARVESAQ 
HQARLQPLAT PRQQVLSFAL DIEPAAESVC TEIDAFGNAR ASFALNQPHD ALLVRSRSTV
RVSAPVWSLG ARGAPPPAIA SPDAEHAMAW EAVRERLQFR AGQPYDAASE FVFASPHVAC
DPELAAYAAA SFTPGRPLVQ AAWDLMRRVH ADFAYTPNST DITTTALDAL RLRKGVCQDF
AHVMIGALRS LGLAARYVSG YLLTQPPPGQ PRLIGADASH AWIDVYDPAW PEDGGWLQLD
PTNDRAPGDD YVMLSIGRDY ADVTPLRGVI RGGGADQELK VGVTVEPLDE AS