Gene Mmcs_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4038 
Symbol 
ID4112868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4310129 
End bp4311412 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content67% 
IMG OID638033181 
Producthypothetical protein 
Protein accessionYP_641199 
Protein GI108801002 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.114172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATCG CAGCCCGTTC GTATCTCTCC GCCGGCGTGG CGCTGGTCGG AGCCGGAGCG 
ATCGCGATCA GCCCGATTGC GCCGCCGTTG CCCGATGCGA AGGTGCCGAC GCTGTCGACG
ATCGGCGTCG AGCTCAACGC CGCCGTCAAC CCGATCCAGA CCTGGATCGA GGTCTTCGGC
CAGGCGGCGG AGAACCTCGC GGTGCTCGGG CAGACGGTGG CGCAAAACCC GGCACCGATT
CTGCAGCAGG TGTTCGAGAA TCAACTCGCG AACATCGAGA CGCTGGCGCC GGCGCTCGAG
GAGTTCTTCA CCGGGGTCGT CGCCTCACTC GATCCGACCA ACCCCGACGG AATCCCGGCA
AATCTACGGG AGGCGTTCGA ACTGATCGTC GGCGGTAACC CGGCCGAGGG GATCCCGGCG
ATCATGCAGG CCTTCCTGCA ACCGATCCTC TTCCCGTCGC TCGCACTCCT GATGCCGGTA
CAGGGCATCA TCACGCAGAC GGCGCAGAAC GTCCTCGACG TCGCCAATCA GGCGCTCACG
GTTTTCGCGG TCGGCGCGAT CGCAGTGCTC ACCCCGGTGT TCAGCGGGTT CAATGCGGCC
GGAGCCGCAG CACAGGCCGT GGTCGACGCC GCCAACGTCG GAGACATCGC GGGCGTGATC
ACCGCGGTGC TCGACATCCC GGGCGTCGTG GTCGGCGGTG TGCTGAACGG TTTCGGGTTC
GACGGCGGGC TTCTCACCCC GGAATCCGGC ACTGTCGCTG CGTTTTTGAC CCTGCGCCAG
ATGATCGCCG ACGCGCTCAA GCCCGATCTG CCGGCGTTGC GCGTCGCCGA CGTCTCGTCG
ACGAAGACGG AGGCCACCGA GACGGTGACC CTGGACCTCA CCGATTCGGG CGTGACTCCG
GCCGCGTTCA AGACCGAGGT AGCCGGTGGC GAGGAGGCTC AGAGCCCGGG CGCCGAGCCG
GGCGACGGTG TTCCCGTCGT GGACGAGAAG TCCGGCGATG AGGTCGTCGA CGAGACGCCT
GCCGAGGAGG CTCCCCAGGA GGAAGCGCCC GACGAGGATG CTCCCGAAGA AGAGGAAGCC
CTCGACGAGG AGACGCTCGA GGACACGCTC GAGGAGGACG CATCCGAAGA GGAGACCGGA
GGCGACGACA CCACCGAGCA GGAACCGTCC GAAGAACAGA ACGCGAACGA TGACAACGCC
AACGACGACA ACGCGAACGA CCCCGGTGAG AAGGAGACCG GAACCGAGAA CACCGGCGGC
GACGCCGGAG GCGCCGAGGC CTGA
 
Protein sequence
MQIAARSYLS AGVALVGAGA IAISPIAPPL PDAKVPTLST IGVELNAAVN PIQTWIEVFG 
QAAENLAVLG QTVAQNPAPI LQQVFENQLA NIETLAPALE EFFTGVVASL DPTNPDGIPA
NLREAFELIV GGNPAEGIPA IMQAFLQPIL FPSLALLMPV QGIITQTAQN VLDVANQALT
VFAVGAIAVL TPVFSGFNAA GAAAQAVVDA ANVGDIAGVI TAVLDIPGVV VGGVLNGFGF
DGGLLTPESG TVAAFLTLRQ MIADALKPDL PALRVADVSS TKTEATETVT LDLTDSGVTP
AAFKTEVAGG EEAQSPGAEP GDGVPVVDEK SGDEVVDETP AEEAPQEEAP DEDAPEEEEA
LDEETLEDTL EEDASEEETG GDDTTEQEPS EEQNANDDNA NDDNANDPGE KETGTENTGG
DAGGAEA