Gene Mvan_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3778 
Symbol 
ID4645141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4020476 
End bp4021897 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID639807243 
Producthypothetical protein 
Protein accessionYP_954566 
Protein GI120404737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.499553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCAT ACGCGGAGCC TGCGGAACTC ATCGACGCGA TGAGTTCGGC GGCGCGTGCG 
GAGTCGGCGG CGATCGCGCG GCGGCTGGAA GCGGTCGCGG CGTTGTTTCG TTCCCGTAAG
TGCGATTACG CCGAGGCAGG GTTCTTGCAC ACTGATGTGT ATGAGGCGGT GGCGGCTGAG
GTGTCGGCCG CGCAGAACAT CAGCCGGTCG CGGGCCGGTT ACCAGGTGGA GATGGCAGTG
TCGCTGTACA CCCGGTTACC GAAAGTGGCC GAAGCGTTCG CGCGGGGCGA TATCGATTTG
CGGATGGTGC AGATAGTTCT GGCCCGCACC AAGAACGTTG AGGATGACGT GATCGGCGGC
CTGGACAAGG CCATCGCGCC CAAATTGTCG CGGTGGATGC GGTTGTCCAA GAACGATCTT
CGGGATCGGG TGGATCTGTG GGTGGCGGAT TTCGATCCGG CCGCGGTGCG GGTGCCGCCG
GAAGCGAAGG ACAACCGCTA CTTCGATGTG ACACCCGATG TGCCGGGGAT GGCCTTCGCC
GGAGGACTGC TCAACGCCCG TGATGCCGCG GCGTTGGATC AGCGTCTGGA GGCGATCGCG
GCGACGGTGT GCAGCAACGA TCCACGCTCG CACAATAATC TGCGGGCCGA CGCGGCGGGG
GCGCTCGGGC GGGGGGAGTC GACCCTGACC TGTGAGTGCG GCGCCGAGGA TTGCCCGGCC
GCGGTGCTAC GGGAGTCCGC GGCGCAGGTG GTGATTCACA TCCTGGCCGA GCAGGCCACG
GTGGACGGAG ACGGTGACAA GGCGGGATAC CTGCCGGGGT TCGGGGTGCT GCCGGCCGAG
GAGGTCCGTG CCGCGGCCAA GACGGCGAAG CTCAAGCCGG TGCGATTGCC CGGCGCCGAA
CCGGAGAAGG GCTACCGCCC GTCGGCCGGA TTGAAGGATT TTCTGCAGTG GCGTGATCTG
ACCTGCCGCT TCCCGGGCTG CGACGCCCCG GTGGAGCGCT GCGATGTCGA CCATACGACG
CGGTGGCCAT TCGGGGTCAC GCATGCCTCG GGGCTCAAGC ATTACTGCCG TACCCATCAT
GTGATCAAGA CGTTCCTCAC GGGGGTGTAC GGCTGGCGCG ACGAGCAGCG TCGCGACGGC
ACGGTCGTGC TGACCGCGCC GACCGGGCAC GTGTACACCA CCGAACCGCT TGGCGGACTG
CTATTCCCGA CACTGGCGAC ACCGACCGCG CCACTGCCCG ACGTCGAAGT GCCCGAAGAT
GACCCGGACA AGGCGGCGAT GATGCCGCGG CGGCGTACCC GTGAGCAGGA ACGGCGGGCT
CGGATCGCGC GCGAACGCCG ACAACGCATC GAGATCAACG CCGAACGCGA ACGGCAACAC
CAGGCCTGGC TCGCCGCAAC GTATGAACCA CCGCCGTTCT GA
 
Protein sequence
MFAYAEPAEL IDAMSSAARA ESAAIARRLE AVAALFRSRK CDYAEAGFLH TDVYEAVAAE 
VSAAQNISRS RAGYQVEMAV SLYTRLPKVA EAFARGDIDL RMVQIVLART KNVEDDVIGG
LDKAIAPKLS RWMRLSKNDL RDRVDLWVAD FDPAAVRVPP EAKDNRYFDV TPDVPGMAFA
GGLLNARDAA ALDQRLEAIA ATVCSNDPRS HNNLRADAAG ALGRGESTLT CECGAEDCPA
AVLRESAAQV VIHILAEQAT VDGDGDKAGY LPGFGVLPAE EVRAAAKTAK LKPVRLPGAE
PEKGYRPSAG LKDFLQWRDL TCRFPGCDAP VERCDVDHTT RWPFGVTHAS GLKHYCRTHH
VIKTFLTGVY GWRDEQRRDG TVVLTAPTGH VYTTEPLGGL LFPTLATPTA PLPDVEVPED
DPDKAAMMPR RRTREQERRA RIARERRQRI EINAERERQH QAWLAATYEP PPF