Gene Mvan_2814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2814 
Symbol 
ID4646546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2980352 
End bp2981878 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content69% 
IMG OID639806295 
Productanthranilate synthase component I 
Protein accessionYP_953627 
Protein GI120403798 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0778236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGATGA CCGCCACCCT CAGCGCGACC ACCACGCGTG AGGACTTCCG GGCGCTGGCC 
GCCGAGCACC GTGTGGTCCC GGTGACCCGC AAGGTGCTCG CCGACAGCGA GACGCCGTTG
TCGGCGTATC GCAAACTCGC CGCCAACCGC CCCGGGACGT TCCTCCTCGA GTCCGCCGAG
AACGGCCGGT CCTGGTCGCG CTGGTCGTTC ATCGGCGCGG GTGCGCCGTC GGCCCTGACC
GTCCGCGACG GCGAGGCGGT GTGGCTGGGT GTCATCCCGC AGGACGCGCC GTCCGGCGGC
GACCCGTTGC AGGCGCTGAG GGCGACGCTG ACGTTGCTGG AGACCGCTCC GCTGCCCGGG
CTGCCCCCGC TGTCGTCGGG TCTGGTCGGC TTCTTCGCCT ACGACCTGGT GCGCCGGCTG
GAGCGGCTGC CCGAGCTGAC CGTCGACGAC CTCGCCCTGC CGGACATGAT GCTGCTTCTG
GCGACCGACG TCGCCGCCGT CGACCACCAC GAGGGCACGA TCACGCTGAT CGCCAACGCC
GTGAACTGGA ACGGCACCGA CGAGCGGGTG GACTGGGCCT ACGACGATGC CGTGGCCCGG
CTCGATGTCA TGACGGCGGC GTTGGCCGAA CCGTTGGCGT CCACCGTGGC GACGTTCAGC
CGGCCGGTGC CGGAACACCG GTCGCAGCGC ACCGTCGAGG AGTACACCGC GATCGTCGAC
AAGCTGGTCG GTGACATCGA GGCGGGCGAG GCGTTCCAGG TGGTGCCGTC GCAGCGGTTC
GAGATGGACA CCGACGCCGA TCCCCTCGAT GTGTACCGGA TGCTGCGGGT GACCAACCCG
AGCCCCTACA TGTATCTGCT CAACGTGCCG AACGCTGATG GGGGACTGGA CTTCTCGATC
GTCGGCTCCA GCCCGGAGGC GCTGGTCACG GTCAAGGACG GCCGCGCCAC CACGCACCCG
ATCGCCGGCA CCCGGTGGCG CGGCGACACC GAAGAGGAAG ACCTTCTGCT GGAGAAGGAG
CTGCTCTGCG ACGAGAAGGA GCGCGCCGAA CACCTGATGC TCGTCGACCT CGGCCGCAAC
GACCTGGGGC GGGTGTGCCG TCCGGGCACC GTGAAGGTCG AGGATTACAG CCACATCGAG
CGGTACAGCC ACGTCATGCA CCTGGTGTCG ACGGTCACCG GTCTGCTCGC CGACGGCAAG
ACCGCGCTGG ACGCGGTGAC GGCGTGCTTC CCGGCCGGGA CGCTGTCCGG CGCGCCCAAG
GTCCGGGCGA TGGAGTTGAT CGAGGAGGTC GAGAAGACCC GTCGCGGCCT CTACGGCGGT
GTCCTTGGCT ATCTCGACTT CGCCGGCAAT GCCGATTTCG CGATCGCGAT CCGGACCGCG
TTGATCCGCA ACGGCACCGC CTACGTGCAG GCCGGCGGTG GCGTCGTCGC GGATTCGAAC
GGCCCGTACG AGTACAACGA GGCCTCCAAC AAGGCCAGGG CGGTGCTGGC CGCGATCGCT
GCGGCAGAGA CGCTGAGCGA ACCGTGA
 
Protein sequence
MQMTATLSAT TTREDFRALA AEHRVVPVTR KVLADSETPL SAYRKLAANR PGTFLLESAE 
NGRSWSRWSF IGAGAPSALT VRDGEAVWLG VIPQDAPSGG DPLQALRATL TLLETAPLPG
LPPLSSGLVG FFAYDLVRRL ERLPELTVDD LALPDMMLLL ATDVAAVDHH EGTITLIANA
VNWNGTDERV DWAYDDAVAR LDVMTAALAE PLASTVATFS RPVPEHRSQR TVEEYTAIVD
KLVGDIEAGE AFQVVPSQRF EMDTDADPLD VYRMLRVTNP SPYMYLLNVP NADGGLDFSI
VGSSPEALVT VKDGRATTHP IAGTRWRGDT EEEDLLLEKE LLCDEKERAE HLMLVDLGRN
DLGRVCRPGT VKVEDYSHIE RYSHVMHLVS TVTGLLADGK TALDAVTACF PAGTLSGAPK
VRAMELIEEV EKTRRGLYGG VLGYLDFAGN ADFAIAIRTA LIRNGTAYVQ AGGGVVADSN
GPYEYNEASN KARAVLAAIA AAETLSEP