Gene Mvan_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3830 
Symbol 
ID4649249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4081203 
End bp4082501 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID639807296 
ProductCBS domain-containing protein 
Protein accessionYP_954617 
Protein GI120404788 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCGG TCGGTCAACT GATCAGCGCC ATCGTGCTGA TCTTTTTCGG CGGGGTGTTC 
GCCGCCATCG ACGCGGCGCT GAGCACGGTG TCGATGGCCC GCGTCGAGGA ACTGGTCCGC
GAGGAACGGC CGGGAGCGGT GCGCCTGGCC CGGGTCATGG TGGAGCGGCC CCGCTACATC
AACCTCATTG TGCTGCTTCG TATCACATGC GAGGTGAGCG CGACGGTGCT GCTGGCCGCG
TTCCTCGACG GAACCCTGGG CGTGACCTGG GGGCTGGTCG CTGCCGCGGC GGTCATGGCG
GTGGTCAGCT TCGTCGCCAT CGGTGTCGGG CCGCGCACGC TGGGCAGGCA GAACGCCTAC
ACCATCGCGC TGGTGACAGC GCTTCCACTG CAAGCCATTT CGGTGCTCCT GCTGCCGGTC
AGCCGGCTGC TGGTGTTGAT CGGTAATGCG CTGACACCCG GTCGCGGATT CCGCAACGGG
CCGTTCGCGT CTGAGATCGA GCTGCGGGAG GTCGTTGATC TCGCCCAGCA GCGCGGCGTG
GTGGCCGATG ACGAACGACG GATGATCCAG TCGGTGTTCG AACTCGGGGA CACACCCGCC
CGCGAGGTCA TGGTCCCGCG CACCGAGATG GTGTGGATCG AGTACGACAA GACCGCCGGG
CAGGCCACCT CCCTGGCGGT GCGCAGCGGG CACTCCCGGA TCCCGGTGGT CGGGGAGAAC
GTCGACGACA TCGTCGGGGT GGTGTACCTC AAGGACCTCG TCCAGCGCAC CTACTACTCG
AGCAACCAGG GCCGTGACAC CTCGGTGTCC GACGTGATGC GCAAGCCCAC GTTCGTGCCC
GACTCCAAGC CGCTGGACGC GCTGCTGCGT GACATGCAGC GCGACCGGGT TCACATGGTG
CTGCTGGTCG ACGAATACGG CGCGATCGCC GGGCTCGTCA CGATCGAAGA CGTGCTGGAG
GAGATCGTCG GCGAGATCGC CGACGAGTAC GACACCGACG AGGTGGCCCC CGTCGAGGAC
CTGGGCGATC AGCAGTACCG GGTGTCGGCA CGGCTGTCGA TCGAAGATCT CGGCGAGCTC
TACGGCATTG ATTTCGAGGA GGACCTCGAC GTCGACACCG TCGGTGGGCT GGTCGCCCTG
GAGCTGGGGC GCGTCCCGCT GCCCGGGGCC GAGGTGACCT GGGACGGACT GCGGTTGCGG
GCCGAGGGGG GACCGGACCC GCGCGGCCGC GTGCGGATCG GCACCGTGCT GGTCAGCCCC
GTCGAGCCGG CGCCCGACAC TGCGGAAGGA CACGAATGA
 
Protein sequence
MSPVGQLISA IVLIFFGGVF AAIDAALSTV SMARVEELVR EERPGAVRLA RVMVERPRYI 
NLIVLLRITC EVSATVLLAA FLDGTLGVTW GLVAAAAVMA VVSFVAIGVG PRTLGRQNAY
TIALVTALPL QAISVLLLPV SRLLVLIGNA LTPGRGFRNG PFASEIELRE VVDLAQQRGV
VADDERRMIQ SVFELGDTPA REVMVPRTEM VWIEYDKTAG QATSLAVRSG HSRIPVVGEN
VDDIVGVVYL KDLVQRTYYS SNQGRDTSVS DVMRKPTFVP DSKPLDALLR DMQRDRVHMV
LLVDEYGAIA GLVTIEDVLE EIVGEIADEY DTDEVAPVED LGDQQYRVSA RLSIEDLGEL
YGIDFEEDLD VDTVGGLVAL ELGRVPLPGA EVTWDGLRLR AEGGPDPRGR VRIGTVLVSP
VEPAPDTAEG HE