Gene Mvan_4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4008 
Symbol 
ID4647464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4285723 
End bp4288854 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content66% 
IMG OID639807470 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_954791 
Protein GI120404962 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR01923] O-succinylbenzoate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCG GGATCACACT CAGCGATGCC CTTGCCCGCC ATGCCGCGGT CAGGCCACAC 
GCATGCGCTC TCGCCGACCC GAGGCGACAC ACGACGTTCG GTGAGCTCGA CGAACGCGTG
ACCCGACTGG CGAGCGCGCT TGCGGCGCGC GGGGTTCGGT CCGGCGACCG GGTCGCCGTG
CTCGGGCTCA ACAGCATCGA GCTCGTCGAA TCCTGGCTGG CCGCGCATCG ACTGGGCGCC
ATCGCGGTGC CGGTGAACTT CCGGCTGGCC GCCGGCGAAA TCGGCTACGT GCTCTCCGAC
AGTGCAGCCA CCGCCATCGT CGTGGATGTG GCGCTGGAAT CCATGGTGGT ACAAGTCCGT
CAGCAGGTAC CGGCACTGCA CACCGTCGTG ACCATCGGCG GGAACCTGGA GCAGACCATC
GCCGCGGCAG ACCCCGATCT GCCGCAGTGC GCCGTCGCAG ACGACGCACC GGCCTTCATC
ATGTACACCT CGGGCACAAC CGGATTCCCG AAGGGCGCCG TGCTCACCCA CCGCAACCTC
TACCTGCACG CGTTCAGTTC GATCGCCACC CTCGGGCACC GCTCCGACGA CGACTGCTGG
ATGGCGGTGG CACCGCTGTT CCACACCGCC GGGGTCTCCG GCATGTTGCC GATGTTCCTG
ACCGGCGGCA AAACCGTCAT CCCACCGTCC GGCGGGTTCG ACCCAGACGC CACGATCGCC
GCCGTCGTCG ACGAGCAGGT CACGTCGTGC TGGATGACCC CGGCCCAGTG GCAGTCCGTC
TGTGCGTTAC CCGGCCTCGC CGCACACGAC CTGTCCCGGC TGCGCCGGGT GTGGTGGGGC
GCCGCCCCGG CATCGACGAC GTTGCTGCGC ACCATGATCG ACACGTTCAC CGGCGCCGAG
ATCATCGCCG CATTCGGCCA GACCGAGTGC AGCCCGATCA CCTGCCTGCT GCGCGGCGAG
GATGCGATCG CCAAGATCGG TTCGGTGGGC ACCCCGATGC TCAACGTCGA GGTGCGCGTG
GTGGACGACG AGATGAACGA CGTCGACCGG GGCGAGGTGG GCGAGATCGT GTATCTGGGG
CCGCTGGTCA TGAAGGAGTA CTGGAACAAG GCCGCCGAGA CCGCAGAGGC GTTCCGCGGC
GGATGGTTCC ATTCCGGTGA TCTCGTCCGA CAGGACGCCG ATGGGTACTT CTACGTCGTC
GATCGGAAGA AGGACATGAT CATCTCGGGC GGCGAGAACA TCTACAGCGC CGAGGTGGAG
AACGTCGTCG CGACGCATCC ATTGGTCGCC GAGGTCGCGG TCATCGGTGT GCCGCACCCG
AAATGGGGTG AGACCCCGGT CGCGGTGATC GTGCCGCGCG AGCCCACCGA TCCTCCGACC
GACGCCGAGA TCGAGGCGCA CTGCCGTGCG CAGCTCGCCT CGTACAAGCG ACCGAAGTAC
GTCACGCTGG TCGACGTGTT ACCACGCAAC GCCGCCGGCA AAGTCCTCAA GGGTCGACTG
CGGGACGAGC ACGCCACGCT GATCTCCTAC AGTGCGGGCC CCACCGACGC CGCGCTGCTG
GACGAGACGA TCGGCACGAA CTTCGAACGC ACCGTGTCGC GGTATCCCGA CAACGAGGCG
CTCGTCGACG TCCCGAGTGG ACGTCGGTGG ACATACGCCG AACTCAATGC CGAAATCGAC
TCATTGGCAC GCGCTTTGAT GGCCATCGGT ATCGAGAAGG GTGATCGCGT CGGGATCTGG
GCGCCCAACT GCCCGGAGTG GACGATGCTC CAGTACGCCA CCGCAAAGAT CGGCGCGATA
CTGGTCACGA TCAACCCCGC CTACCGCACC CACGAGCTGG CCTACGTGCT GCGGCATTCC
GCCGTCCGGC TGCTGGTGTC GGCGACCGAG TTCAAGACCT CCGACTACCG CGCCATGGTC
GCAGAGGTAC GGCCGGAGCT GCCCGGCCTG GCTGAAGTGC TGTTCCTGGC CACCGAGGAC
TGGGCACGGC TCGGCGAAAG GGCCGATCTG GTGTCCGAGG ACGAACTGCG ATGCCGGGTC
CGGAGTTTGA CCCCCGGCGA CGCCATCAAC ATCCAATACA CCTCAGGCAC AACGGGTTCA
CCCAAGGGCG CGACACTGTC ACACCGCAAC ATCCTCAACA ACGGCTACTT CGTCACCGAT
CTGATCGACT TCGGCCCCGG TGACCGGCTC TGCATACCGG TGCCGTTCTA CCACTGCTTC
GGCATGGTCA TGGGCAACCT CGGCTGCACG ACGCACGGCG CCACCATGGT GATTCCGGCG
GCGGGTTTCG ATCCTGCGGC GACGTTGGCC GCCATCGAGA AGGAGCACTG CACAGCGGTT
TACGGCGTGC CGACGATGTT CATCGCGATG CTGGGGCACC CCGACCTCGC CGACTGCGAC
GTGACGTCGC TGCGCACCGG GATCATGGCC GGCTCGCCGT GTCCGGTGGA GGTGATGAAA
CGTTGCGTCA ACGAATTGAA GATGTCGGAG GTCGGCATCG CTTACGGCAT GACGGAGACA
TCGCCGGTGT CCTGCCAGAC CCGGATCGAG GACGACCTCG ATCGGCGTAC CGCGACGGTC
GGCCGCGCGC ACCCACACGT GGAGATCAAG ATCGTCGACC CCGACACCGG CGAGATCGTC
AAACGCGGCA CGGCCGGCGA ATTCTGTACC CGGGGGTACT CGGTGATGCT CGGCTACTGG
GGTGACGAAG ACAGGACCAG GGAGGCTGTC GATGCCGACG GATGGATGCA CACCGGCGAT
CTGGCCGTGA TGCGCGACGA CGGGTACTGC ATGATCGTCG GCCGCATCAA AGACATGGTG
ATCCGCGGTG GCGAGAACGT CTACCCACGC GAAATCGAGG AGTTCCTGCA CACCCATCCC
GACATCGACG ATGTCCAGGT GATCGGTGTG CCCGACGAGC GTTACGGCGA GGAGATCTGC
GCCTGGATCA AGGTGCGGGC GGGCGCGGCA CCGCTGGACG CCCATGCCGT GCGCGAGTTC
GCTGCCGGGA AACTCGCGCA CTACAAGATC CCCCGCTACG TCCACATGAC CGACGACTTC
CCGATGACCG TCACCGGGAA GGTTCGCAAG ATCGACATGC GCGCCGAGAC GGTGCGGATC
CTCGGGCTGT GA
 
Protein sequence
MITGITLSDA LARHAAVRPH ACALADPRRH TTFGELDERV TRLASALAAR GVRSGDRVAV 
LGLNSIELVE SWLAAHRLGA IAVPVNFRLA AGEIGYVLSD SAATAIVVDV ALESMVVQVR
QQVPALHTVV TIGGNLEQTI AAADPDLPQC AVADDAPAFI MYTSGTTGFP KGAVLTHRNL
YLHAFSSIAT LGHRSDDDCW MAVAPLFHTA GVSGMLPMFL TGGKTVIPPS GGFDPDATIA
AVVDEQVTSC WMTPAQWQSV CALPGLAAHD LSRLRRVWWG AAPASTTLLR TMIDTFTGAE
IIAAFGQTEC SPITCLLRGE DAIAKIGSVG TPMLNVEVRV VDDEMNDVDR GEVGEIVYLG
PLVMKEYWNK AAETAEAFRG GWFHSGDLVR QDADGYFYVV DRKKDMIISG GENIYSAEVE
NVVATHPLVA EVAVIGVPHP KWGETPVAVI VPREPTDPPT DAEIEAHCRA QLASYKRPKY
VTLVDVLPRN AAGKVLKGRL RDEHATLISY SAGPTDAALL DETIGTNFER TVSRYPDNEA
LVDVPSGRRW TYAELNAEID SLARALMAIG IEKGDRVGIW APNCPEWTML QYATAKIGAI
LVTINPAYRT HELAYVLRHS AVRLLVSATE FKTSDYRAMV AEVRPELPGL AEVLFLATED
WARLGERADL VSEDELRCRV RSLTPGDAIN IQYTSGTTGS PKGATLSHRN ILNNGYFVTD
LIDFGPGDRL CIPVPFYHCF GMVMGNLGCT THGATMVIPA AGFDPAATLA AIEKEHCTAV
YGVPTMFIAM LGHPDLADCD VTSLRTGIMA GSPCPVEVMK RCVNELKMSE VGIAYGMTET
SPVSCQTRIE DDLDRRTATV GRAHPHVEIK IVDPDTGEIV KRGTAGEFCT RGYSVMLGYW
GDEDRTREAV DADGWMHTGD LAVMRDDGYC MIVGRIKDMV IRGGENVYPR EIEEFLHTHP
DIDDVQVIGV PDERYGEEIC AWIKVRAGAA PLDAHAVREF AAGKLAHYKI PRYVHMTDDF
PMTVTGKVRK IDMRAETVRI LGL