Gene Mvan_5049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5049 
Symbol 
ID4644786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5402950 
End bp5404284 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content76% 
IMG OID639808520 
ProductFHA domain-containing protein 
Protein accessionYP_955827 
Protein GI120405998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGAC GCCACCGTAA GCCCGCACCA TCTGCTGTAA ACGTCGCCAA GATCGCTTTC 
ACCGGCGCAG TCATCGGAGG CGGAGGCCTC GCCCTGGCCG GCCAGGCCGG TGCAGCCACC
GACGGCGAGT GGGACCAGGT CGCTCGGTGC GAGTCGGGCG GCAACTGGGC GATCAACACC
GGCAACGGCT ACCACGGCGG CCTGCAGTTC TCTCCGGGCA CCTGGTCGGG CCACGGCGGT
GGCGAGTACG CCCCGGCGGC CTACCTGGCC ACCAAGGAAG AGCAGATCGC GGTCGCCGAG
CGTGTCCTGG CATCTCAGGG CAAGGGCGCG TGGCCGTCTT GCGGCGGGCC GCTCTCCGCC
GCAACGCCGC GCAACGTCGT CGCCGAGCCG CCCGCTCCGG TCGATCCTGT GGGACTCAAC
GGCCTGCTGC CGCCGCCTCC GGTCGATCCG TTCGCCCCGC CGCCCGCGCC GGCCCCTGTC
GATGCTCTCG CGGCCCCCGC GCCCCTGCCG CCCGCGCCCG AGGCGCTGCC CCCGGCGCCC
GCACCGGCCG CACCGTTCGA CGCGATGTCC GCCCCGGCCC CGTTGCCGCC CGCACCCGAG
GCGCTGCCCC CGGCGCCCGC ACCGGCCGCA CCGTTCGACG CGATGTCCGC CCCGGCCCCG
TTGCCGCCCG CACCCGAGGC GCTGCCCCCG GCGCCGGTGG TCGACGCCGC CAACATCGCT
CCCCTGGACG CACCGCTGCC GCCGCCGGCC CCGGTCGACG CCCCGCCTGC GCAGAGCGCC
ATCATCGCCG CCGCGAACTG GGACACCGCG CCGGTGCCCG GTGAGCACCC GCAGTTGTGG
TCACTCGGCG TGGATGCTCC GCTGCAGCCC GCCCCCGTGC TGCCGCCTGC TCCGGCACCT
GCCCCGGCCC CGGCTGCTGT CGCCGCGCCC GCACCCGATC CGCTGGCCCC CCTCGGCGCC
GTCGAGGTGC CCGCCCCGGC CTACGACATC GCGAACCAGG CGATCAGCGG CGAGCTGCCC
GTTCCCTCCG AGGTGCCGCA CCTGGCCAGC CCGGACAACC TGCCCCCCGG CACGGCGATG
GCGCCGCAGG GCCCCCGGCA GAGCCCCAAC GTGACGTACC TCAAGGAGAT CTGGCACGCG
ATCCAGACTC AGGAGATCTC CGGTGCCGAT GCGCTGCTGG CGCTTACCCA GCGGCCGCTG
ACGACCCCCG ACACCCCGGG CGGCACCCCG CCGATCGCTC CGGGCGCTCC GGGCGCCCCG
GTGGCGCCGC TGGCAGGGCC GGCACCGGCT CCCGCTCCTG TCCCGGCACC CGCCCCGGTG
TTGCCACCCG CCTGA
 
Protein sequence
MSGRHRKPAP SAVNVAKIAF TGAVIGGGGL ALAGQAGAAT DGEWDQVARC ESGGNWAINT 
GNGYHGGLQF SPGTWSGHGG GEYAPAAYLA TKEEQIAVAE RVLASQGKGA WPSCGGPLSA
ATPRNVVAEP PAPVDPVGLN GLLPPPPVDP FAPPPAPAPV DALAAPAPLP PAPEALPPAP
APAAPFDAMS APAPLPPAPE ALPPAPAPAA PFDAMSAPAP LPPAPEALPP APVVDAANIA
PLDAPLPPPA PVDAPPAQSA IIAAANWDTA PVPGEHPQLW SLGVDAPLQP APVLPPAPAP
APAPAAVAAP APDPLAPLGA VEVPAPAYDI ANQAISGELP VPSEVPHLAS PDNLPPGTAM
APQGPRQSPN VTYLKEIWHA IQTQEISGAD ALLALTQRPL TTPDTPGGTP PIAPGAPGAP
VAPLAGPAPA PAPVPAPAPV LPPA