Gene Mvan_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1644 
Symbol 
ID4648686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1747082 
End bp1748308 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content70% 
IMG OID639805139 
Producthypothetical protein 
Protein accessionYP_952479 
Protein GI120402650 
COG category[S] Function unknown 
COG ID[COG5383] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.589728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.39412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTAGCG TCGAAACATG GCAGTTGCGG GCGCAATTCG CCGCCGGGCT GTCACAGATG 
TACGGCGCCG AGGTGCCCGC CTACCACACC CTGGTCGACG TCAGCTCCGC GGTGAACGCC
GCGTACGCCG CGGCCTCTCC GGCACTGCGA CTGGGTTCGC TGGAGCGTGT CACCGCCGAA
CGCCACGGCG CCATCCGGGT CGGCTCCGCC GCCGAGCTGG CGCAGGTGGC CGACCTGTTC
TCGGCGTTCG GGATGCATCC CGTCGGCTTC TACGACCTGC GGGAGGCCGC GTCCCCGGTG
CCCGTGGTGT CCACGGCGTT CCGGCCGGTC GATCCGGATG AATTGGCCCG CAACCCCTTT
CGGGTGTTCA CCTCGATGCT GGCCAGCGCC GACATACGGT TCTTCACCGC GGACCTGCGT
GGACGGGTCG AGCGGTTCGT CGCCGGGCGG CAGCTGTTCG ATCCCGGATT GATCGCCTCG
GCACACCTGA TCGCCGCCGC CGGCGGATGT CCGGCCGACG AGGCTGCCGA CTTCGTCGCC
GCCGCGGTGT CGGCGTTCGC GCTGTCCCGC GAGCCGATCG ACCGCGCCTG GTACGACGAA
CTCGCCGCGG TCTCGGCCGT CGCCGCCGAC ATCGCGGGCG TCACCTCCAC CCACATCAAC
CACCTGACCC CGCGGGTGCT CGACATCGAC GAGTTGCAGC GCGCGATGAC CGAGCGCGGC
ATCACGATGA TCGACCACAT CCAGGGGCCG CCCCGGGTAT CCGGACCTCA GGTGTTGTTG
CGGCAGACGT CCTTTCGGGC GCTGGCCGAA CCGCGGCGGT TCCGCACGCC CTCCGGCGAG
GTGGTCGACG GCACGCTGCG GGTGCGGTTC GGCGAGGTCG AGCAGCGCGG GATCGCGCTC
ACGCCGCTGG GCCGTTCGCA TTACGACGCG GCGATGGCCT GCGCCGACCC TTCGGCGGTG
TGGGGTGAGC ACTTCCCCGC CACCGATGCC GAGATGGCCG CGTCCGGGCT GGCCTACTAT
CACGGTGGCG ATCCGTCGAG ACCTGTTGTG TACGAAGACT TTCTCCCCGC CTCGGCGGCC
GGGATCTTCC GCTCCAACCT CGACACCGAC GCCGTGGCCG GCGCGGGCGG CGACACCACC
GACTACAGCC TCGACTGGAT GGCCGGGCAG ATCGGCCATC ACATCCACGA CCCCTACGAC
CTCTACGAGA AAGTCGCGTC CTCATGA
 
Protein sequence
MCSVETWQLR AQFAAGLSQM YGAEVPAYHT LVDVSSAVNA AYAAASPALR LGSLERVTAE 
RHGAIRVGSA AELAQVADLF SAFGMHPVGF YDLREAASPV PVVSTAFRPV DPDELARNPF
RVFTSMLASA DIRFFTADLR GRVERFVAGR QLFDPGLIAS AHLIAAAGGC PADEAADFVA
AAVSAFALSR EPIDRAWYDE LAAVSAVAAD IAGVTSTHIN HLTPRVLDID ELQRAMTERG
ITMIDHIQGP PRVSGPQVLL RQTSFRALAE PRRFRTPSGE VVDGTLRVRF GEVEQRGIAL
TPLGRSHYDA AMACADPSAV WGEHFPATDA EMAASGLAYY HGGDPSRPVV YEDFLPASAA
GIFRSNLDTD AVAGAGGDTT DYSLDWMAGQ IGHHIHDPYD LYEKVASS