Gene Mkms_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3800 
Symbol 
ID4611735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4014151 
End bp4015863 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content68% 
IMG OID639793480 
Productvon Willebrand factor, type A 
Protein accessionYP_939783 
Protein GI119869831 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4548] Nitric oxide reductase activation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00674568 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCGACG CCTCCACCCC GGGGAACCCA GACCGGTTCC GGTTCCTCGC CACCTACATC 
GCCGGCAGGT CGGTCGATGT CACCGAGGCA GCGGACGGCC AACCCGTGCA CACCGACGGC
CAGTTCATCT TCGTCTCGGC CGGAGGTTCG ATCGAGCAGC AACGCCGCGA GATGATGGTC
CAGGCCGCAC TTCTCGGTGC GGGCAGCCTC GATCCGCGAT TGGTGAAGGG ACTCCGGGCT
CGGGCCACGA CGGCGCGCCG TTACCTGACG CTCGAAGGAC AGCGCGCTCT CGCCCAACTC
GCGCAACGGA TTCCGCTCGA TGCCGCGCTG CTGGCCGACG GCCGACCCGC CACTGCGACC
GCCGAAGAGT CGCTCGAGGT GGCCAAGAGC CGAGCCAAGG TCGCCGACCC CCCGGAATGG
TTCGGCGTCA TCAAACCCTC CCGGCTGATG GCGGCTCCCG CCGGACCGGG TGGACAGGCC
ACCAACAAAG ACCTCAAACT GCAGTTCGAC CCCATCGACA TGCCCGAGTC GGATGACGAC
GACGAGGACC AGGACGATGA CGGCGGGAAG TCCGGGGACA GCAAGATCCT CAAGCTGTTC
GAGAGTCCGA TCTTCAACAA CCAGTCGATG TCGGACTACA TGCGAAAGAT GTTCGGCGGT
AAGCGTTCCG AAGGGGAGGG TGCCGCGGGC GCGGAGATGA CAGTTCGCTC CACACGGCGA
GTACAGGAGA TCGGCGCGAA TGCCCGTCCC CTGCCCACCC GGATCCAGTT CACCGACGAC
GGCAAACCCG GTGCGGCGTT GGGCGTGGGA GGCGCCCTCT ATCCGGAATG GGACGTCTTC
AACGACCGGT ACAAGCCGGA CTGGTGCCGG GTGATCAACT TCCCATTGAC GGTCGCCGCC
GACGTCTCGG ATGCCGGTGT CGCGCACGAC GACGTGTTGC GGCGCCGCCT GGCTCGCGTC
GGGCTCGGCC CAAAGGTGCT GCGCGGCCGC GCCGACGGCG ACGACCTCGA CATCGAGGCG
CTCATCGACC TGTTCGTCGA CCTGCAGTCC GGCTTCTCCG GTGCCGAACA CGTTTATCTG
GAGCGCCGTA AACTCGCCCG CAATCTCGGC GTGCTGATCC TCATCGACGC GTCCGGGTCG
GCCGTCGACG CCGATACGGA CGGCCTCGCG GTGCACGACC ACCAGCGACG GGCGGCCGCC
ACCTTGGCCG TCACCCTCGA GGAGCTCGGT GACCGCGTCG CCGTCTACGC ATTCCGGTCA
CAGGGCCGGC ACGCTGTGCA TCTGCCGGCC ATCAAGACGT TCGACCAGAG TTTCGGCGCC
GTCGGGCGGG CCCGGCTCAA CCAGCTCGAG CCGGCGAGCT ACACCCGCCT CGGGGCCGGA
ATCCGGGGTG CGGGCGAGGT TCTCAAGAAC GAGGCCGGCA CACCGAACCG ACTGCTGCTG
GTCCTCTCGG ACGGATTTCC CTACGACGAC GGCTACGAGG GCCGCTACGC GGAAGCCGAC
GCGCACAAGG CTCTCGAAGA GCTCCGCACC GAGGGCGTCG CCTGCCTGTG CCTGTCCATC
GGCGCCGCCA CGGAAACCGA TGTGCTCGAA CGCGTCTTCG GTTCCGCCAG CTTCGCCAGC
GCCGCAGATC TCTCCGAGTT GAGCCCCCAG ATGGACGAGT TGTTCATGTC CGCGCTCGCC
GAACTCGCCG CGCCGAAACC CGCGCGGGTG TGA
 
Protein sequence
MTDASTPGNP DRFRFLATYI AGRSVDVTEA ADGQPVHTDG QFIFVSAGGS IEQQRREMMV 
QAALLGAGSL DPRLVKGLRA RATTARRYLT LEGQRALAQL AQRIPLDAAL LADGRPATAT
AEESLEVAKS RAKVADPPEW FGVIKPSRLM AAPAGPGGQA TNKDLKLQFD PIDMPESDDD
DEDQDDDGGK SGDSKILKLF ESPIFNNQSM SDYMRKMFGG KRSEGEGAAG AEMTVRSTRR
VQEIGANARP LPTRIQFTDD GKPGAALGVG GALYPEWDVF NDRYKPDWCR VINFPLTVAA
DVSDAGVAHD DVLRRRLARV GLGPKVLRGR ADGDDLDIEA LIDLFVDLQS GFSGAEHVYL
ERRKLARNLG VLILIDASGS AVDADTDGLA VHDHQRRAAA TLAVTLEELG DRVAVYAFRS
QGRHAVHLPA IKTFDQSFGA VGRARLNQLE PASYTRLGAG IRGAGEVLKN EAGTPNRLLL
VLSDGFPYDD GYEGRYAEAD AHKALEELRT EGVACLCLSI GAATETDVLE RVFGSASFAS
AADLSELSPQ MDELFMSALA ELAAPKPARV