Gene Mvan_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2024 
Symbol 
ID4647225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2163115 
End bp2165418 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content66% 
IMG OID639805509 
Productcarbon starvation protein CstA 
Protein accessionYP_952847 
Protein GI120403018 
COG category[T] Signal transduction mechanisms 
COG ID[COG1966] Carbon starvation protein, predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACAC CCACCGCCGC ATCGGAACGC ATCGAGGAAA CCGTCGGAGA CATCACCTAC 
ATCCGTACGG ACAAGAACCT GCCGCCGGTG GCGATCATCG ACAGATCGCC GATCACCGTA
AAACACAGGA TCATCTTCGC GGTCGTCGCA CTGCTGGGCG CGGTCTCCTG GGCGATCATC
GCGTTCTTCC GGGGCGAGAC GGTCAACGCG GTGTGGTTCG TCTTCGCCGC GATCTGCACC
TACGTCATCG GTTTCCGGTT CTACGCCCGC CTCATCGAGA TGAAGATCGT CCGGCCGCGC
GACGAGATTG CCACGCCGGC AGAGGTTTTC GACAACGGCA CCGACTACAT GCCGACCGAT
CGGCGGGTGC TCTACGGGCA CCACTTCGCG GCGATCGCCG GTGCCGGGCC ACTGGTCGGC
CCGGTACTCG CCATGCAGAT GGGTTATCTG CCGGGCACCA TCTGGATCAT CATCGGCGCG
GTCGTCGCCG GATGCGTGCA GGACTACCTG GTGCTGTCGA TCTCCGTGCG TCGGCGCGGT
CGCTCGCTGG GTCAGATGGC GCGCGACGAA CTCGGCGCCG TCGGCGGAGT CGCGGCCATC
GTCGGCGTGC TGGTGATCAT GGTGATCCTG CTGGCGGTGC TGGCGCTGGT CGTGGTCAAC
GCGTTGAGCG AGAGCCCGTG GGGCGTCTTC TCGATCGCGA TGACGATACC TATCGCACTG
TTCATGGGTC TGTACCTGCG TTTCCTGCGC CCCGGGCGCG TCTCGGAGGT CTCGTTGATC
GGCGTGGTGC TCCTGCTGCT CGCGGTTGTC GCGGGCGGCT GGGTGGCCGA GACATCTTGG
GGCGCCGAGT GGTTCACGCT GTCCAAGGTG GCGCTGTCCT GGTGCATCAT CATCTACGGC
CTGGCTGCCT CGGTGCTGCC GGTGTGGCTG CTGCTCGCCC CTCGCGACTA CCTGTCGACG
TTCATGAAGG TCGGCACCAT CGCGCTGCTG GCGGTCGGGA TCCTGCTGGC CCGCCCGATC
ATGGAAGCGC CCGCGATCTC GTCGTTCGCC GCCAGCGGCA CCGGGCCGGT GTTCGCCGGT
TCGCTGTTCC CGTTCCTGTT CATCACCATC GCGTGCGGCG CGCTGTCGGG GTTCCATTCG
CTGATCTCGT CGGGCACCAC CCCGAAGCTG CTGGAGAAGG AAAGCCAGAT GCGGCTGATC
GGCTACGGCG GCATGCTGAC CGAGTCGTTC GTCGCGATCA TGGCGCTGAT CACCGCCGCG
ATCCTCAACC AGCATCTGTA TTTCGTGATG AACGCCCCCA CCGCGTCGAC AGGCACCACC
GCCCAGTCGG CAGCCGACTA CGTCAACGGT CTCGGACTGT CCGGTGCACC GATCAGCGCG
CAGGAGATCA CCGACGCCGC CGAGAGCGTG GGTGAGGAGT CCATCGTGTC CCGCACCGGC
GGCGCCCCGA CCCTGGCGTT CGGGATGTCG GAGGTGCTGC ACCAGGTGTT CGGCGGCGCG
AGCCTCAAGG CGTTCTGGTA CCACTTCGCG ATCATGTTCG AGGCGCTGTT CATCCTGACC
ACCGTCGACG CCGGGACCCG GGTCGCGCGG TTCATGCTCT CCGACGGGTT GGGCAACCTC
GGCGGCCCAC TCAAGCAGCT GCGCAATCCC AGTTGGCGGG TCGGCGCCTG GATCTGCAGC
ATCATCGTCG TCGCGGCCTG GGGCAGCATT CTGCTGATGG GCGTCACCGA CCCCCTCGGC
GGCATCAACA CCCTGTTCCC GCTGTTCGGC ATCGCCAACC AGTTGTTGGC TGCGATCGCG
CTGACGGTCG CGACCGTCGT GGTCATCAAA CGCGGTCTGC TGAAATGGGC GTGGATACCC
GGGGTTCCGC TGCTGTGGGA TCTGGTGATC ACGATGACGG CCTCGTGGCA GAAGATCTTC
TCCGGCGACC CCAAGGTCGG CTACTGGACG CAGCACTTCC AGTACCGCAA CGCCAGGGAC
GCCGGACAGA CGAGCTTCGG CGCCGCCAAG GACGCCGGAG CGCTCGACGC CGTCATCCGC
AACACCTTCA TCCAGGGCAC GCTGTCGATC GTCTTCGCGG TGCTGGTGCT CATCGTGTTC
ACCGCGGGAG TCGTCATGGC GGTCAAGGCG ATTCGCGGCA CCGGCCGGCC GCTGGCCGAA
GACAACCCGA TACCGTCGCG CATCTTCGCG CCGTCCGGCA TGGTGATGAC TTCAGCGGAG
AAGGAGGTGC AGAAGCAGTG GGACGCACTT TCGAAGGCGC ACGCCGGGCC GTCCACCAGA
TCGGCTGGTA CTGGAAGTCG CTGA
 
Protein sequence
MATPTAASER IEETVGDITY IRTDKNLPPV AIIDRSPITV KHRIIFAVVA LLGAVSWAII 
AFFRGETVNA VWFVFAAICT YVIGFRFYAR LIEMKIVRPR DEIATPAEVF DNGTDYMPTD
RRVLYGHHFA AIAGAGPLVG PVLAMQMGYL PGTIWIIIGA VVAGCVQDYL VLSISVRRRG
RSLGQMARDE LGAVGGVAAI VGVLVIMVIL LAVLALVVVN ALSESPWGVF SIAMTIPIAL
FMGLYLRFLR PGRVSEVSLI GVVLLLLAVV AGGWVAETSW GAEWFTLSKV ALSWCIIIYG
LAASVLPVWL LLAPRDYLST FMKVGTIALL AVGILLARPI MEAPAISSFA ASGTGPVFAG
SLFPFLFITI ACGALSGFHS LISSGTTPKL LEKESQMRLI GYGGMLTESF VAIMALITAA
ILNQHLYFVM NAPTASTGTT AQSAADYVNG LGLSGAPISA QEITDAAESV GEESIVSRTG
GAPTLAFGMS EVLHQVFGGA SLKAFWYHFA IMFEALFILT TVDAGTRVAR FMLSDGLGNL
GGPLKQLRNP SWRVGAWICS IIVVAAWGSI LLMGVTDPLG GINTLFPLFG IANQLLAAIA
LTVATVVVIK RGLLKWAWIP GVPLLWDLVI TMTASWQKIF SGDPKVGYWT QHFQYRNARD
AGQTSFGAAK DAGALDAVIR NTFIQGTLSI VFAVLVLIVF TAGVVMAVKA IRGTGRPLAE
DNPIPSRIFA PSGMVMTSAE KEVQKQWDAL SKAHAGPSTR SAGTGSR