Gene Mvan_0704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0704 
Symbol 
ID4643647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp746801 
End bp747799 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content67% 
IMG OID639804204 
Productextracellular solute-binding protein 
Protein accessionYP_951548 
Protein GI120401719 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.721157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0331446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTT GCGCGAAGAA GAAACAGCCG CTGATGAAGG TGTCTGCATG GGGCGCGCTG 
CTGGCGGGGG TGCTGGTGTT GGGGGGGTGC GCGCAGACGT CGCCGGTGGT GCCGACACCG
AGTGTCACGC TGGCGCCGCC GACGCCTGCG GGGCTGGAGG AGATGCCGCC GGAGCCTGCG
CGTGCACCGA CCGCCGCGGA CGACGACTGT GACCCCCTGG CCAGCCTGCG CCCGTTCGAC
AACAAGGAAG ACGCCGACAA GGCGGTGGCC AACATCAAGG CCAGGGGCAG GCTCATCGTC
GGCCTCGACA TCGGCAGCAA CCTGTTCAGC TTCCGCGACC CGATCACCGG CGAGATCACC
GGCTTCGACG TCGACATCGC CGGTGAGATC GCGCGCGACA TCTTCGGCAC CCCGTCGCAG
GTGGAATACC GCATCCTGTC TTCGGCGGAT CGCGTCGAGG CGCTGCAGAA GAACCAGGTC
GACGTGGTCG TCAAGACGAT GACGATCACC TGTGAGCGCA AGAAACTGGT GAACTTCTCG
ACTGCGTACC TGTCCGCCAA CCAGCGCATC CTGGCACCGC GGGATTCGAA CATCCGGCAG
TCGTCCGACC TGTCGGGCAA GCGGGTCTGT GTCGCCAAGG GCACCACGTC GCTGGAACGC
ATCCAGCAGA TCACGCCGCC GCCGATCATC GTCGGCGTGG TCACCTGGGC GGACTGCCTG
GTCGCGTTGC AGCAGCGGCA GGTCGACGCT GTCAGCACCG ACGACTCGAT CCTGGCCGGG
CTGGTGTCCC AGGACCCCTA TCTGCACATC GTGGGACCGT CGATGAACGA GGAGCCTTAC
GGCATCGGTG TCAACCTGGA AAACACCGGG CTGGTGCGCT TCGTCAACGG GACGCTGCAG
CGCATCCGGC GCGACGGCAC CTGGAACACG CTGTACCGCA AGTGGTTGAC CGTACTCGGG
CCAGCGCCCG CGCCCCCCGC CGCGAGGTAC TCGGACTGA
 
Protein sequence
MSACAKKKQP LMKVSAWGAL LAGVLVLGGC AQTSPVVPTP SVTLAPPTPA GLEEMPPEPA 
RAPTAADDDC DPLASLRPFD NKEDADKAVA NIKARGRLIV GLDIGSNLFS FRDPITGEIT
GFDVDIAGEI ARDIFGTPSQ VEYRILSSAD RVEALQKNQV DVVVKTMTIT CERKKLVNFS
TAYLSANQRI LAPRDSNIRQ SSDLSGKRVC VAKGTTSLER IQQITPPPII VGVVTWADCL
VALQQRQVDA VSTDDSILAG LVSQDPYLHI VGPSMNEEPY GIGVNLENTG LVRFVNGTLQ
RIRRDGTWNT LYRKWLTVLG PAPAPPAARY SD