Gene Mvan_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1021 
Symbol 
ID4644242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1070916 
End bp1071983 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID639804522 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_951865 
Protein GI120402036 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0514816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.754693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAT TTGTCGGACA CGACGCGCAT CCCGCCGACG TGGACACGCA CAACGCCGAC 
CCGGTGCTCG ACGTCGACGG CCTGACCGTG GAGGTTCGCA CCATCGGCGG AACCCTGCGC
GCCGTCACCG GCGTCTCGTT CCAGGCCCGT CGCGGTGAAA CCCTGGCCCT GCTGGGTGAA
TCCGGCTGCG GCAAATCGAT GACCGCCACC GCACTGGTCG GTCTGCTGGA CCCGGTCGCC
GACATCGTCG GCGGCACGGC GCTGCTGGCC GGCCGGCAGG ACCGAGTGGA CCTGCTCAGC
ATGGACCGCA GGAAGCGACG CGAGATGGCG GGCACCGAGC TGGCGATCGT GTTCCAGGAC
GCGCTCACCG CGCTCAACCC GCTCTACACC GTGGGAACCC AACTGGCCGA ACCATTCCGG
ATCCACCAGG GTCTGAGCGC CAGGGAGGCC AAGCGCAAGG CTGTCGACCT GATGGCCCGG
GTCGGCATCC CGCAACCGGA AAGCCGGCTG AACTCCTACC CGCACCAGTT CTCCGGCGGG
ATGCGACAGC GCCTGCTGAT CGCGATGGCC GTGGCGCTGA ATCCGAGCGT GCTGATCGCC
GACGAGCCCA CCACCGCACT CGACGTCACC GTGCAGGCCC AGATCATGGC GCTGCTGCGC
GATCTGCGCA CGGAATACCG CATGGCCGTC GTGCTGATCA CCCACGATCT CGCGCTGGTT
GCCGAGGAGG CCGACCGGGT CGCGATCATG TACGCCGGAC ACATCGTCGA AACCGGCACT
GTTGCCGAGG TTTTCGCCCA CCCCAAGCAT CCCTACACCC AAGGCTTGCT GAGTTCGGTG
CCCGTCAACG CCCGCCGCGG TGATGCGCTC ACGTCGATCG GTGGCTCCCC GCCGGATCTG
CACTCGATCC CGCAAGGCTG CGTGTACCAG GCGCGGTGCC CACTCGCGGC CGAAGTGTGC
CGCACCACCA GGCCCGCGCT GGCCCCGGTG GGTGGCGACC GCAAGGCCGC GTGCCACTTC
CCGGAAGAGG TGTCCCCCCG CATGCGAGGA GAGAGAACAG ATGTCTGA
 
Protein sequence
MTAFVGHDAH PADVDTHNAD PVLDVDGLTV EVRTIGGTLR AVTGVSFQAR RGETLALLGE 
SGCGKSMTAT ALVGLLDPVA DIVGGTALLA GRQDRVDLLS MDRRKRREMA GTELAIVFQD
ALTALNPLYT VGTQLAEPFR IHQGLSAREA KRKAVDLMAR VGIPQPESRL NSYPHQFSGG
MRQRLLIAMA VALNPSVLIA DEPTTALDVT VQAQIMALLR DLRTEYRMAV VLITHDLALV
AEEADRVAIM YAGHIVETGT VAEVFAHPKH PYTQGLLSSV PVNARRGDAL TSIGGSPPDL
HSIPQGCVYQ ARCPLAAEVC RTTRPALAPV GGDRKAACHF PEEVSPRMRG ERTDV