Gene Vapar_5936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5936 
Symbol 
ID7974975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012792 
Strand
Start bp637656 
End bp639026 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content63% 
IMG OID644796500 
ProductHipA N-terminal domain protein 
Protein accessionYP_002947774 
Protein GI239820589 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.647866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGGT CTGAGGACAA CGTGTATCGC TTGCGCGTGA CGCACGATGG TGGCACCCCG 
GTCGGCGAAC TCGCCTACTC AAGAGCGGAA GACCGGTGGT CGTTCCGTTA TGACCACGCA
TGGGCGCGCC AGGGTGCCTT TCAACTCTCG CCGGCGTTTC CCTTGGAGCC GCCACCGGAC
GGCTATGACT CTCATGCGAT CCGACGCTTC ATCGTGAACC TCTTTCCGGA GGGGGCGCCT
CTTCGCGCCG CGCTCGAGCA ACTCCACGTC GCACCGAGCA ACGCATTTGC ACTGCTGCGG
GAAATGGGAG GGGAGACGAC CGGGGCCCTG GAGTTCCAGC CCTTCGATCA ACCACCGGCT
GCGGCCGCTC GTCGAGAACA ACGCTTTCTG TCGCGAGAGG AGCTCAGCGG CCGTATCGAT
GCTGCAAAGG AAGGCGGCCT CACGGTGTGG GACGGCCGGG TGCGAATGTC GATTGCGGGC
TATCAGGACA AGCTGGCCGT ATGGGCTGCG CACGACCTCG TCCATGACAC AGAGGCCAGC
ATGTGGCTGC CGGAGCCGCC ACTGGCCTCG ACTTTTATTC TCAAGCCGCA GCCGGCCGGC
CCACGTACAC CTCACCTCGT GGCCAACGAG CACTACTGCA TGACGCTCGC AGGGGCGTAT
GGCGCTCAGG TGGCCCGTGT TGCCATCATG CGGCTGAGGG TTCCGGTTCT GGTCGTCGCC
CGGTTCGACC GGCAATGGCG CGCCGAAGAA AACCACGATT GGGTGACAAA GCTGCACGTC
ATCGACGCCT GCCAGGCTGC CGATCTTTCG GTGGATTCCA AATATGAGCG CCATCTGGGC
AATTCCCCCG CCCTCGCACC ATATCGCGAT GGGATGAGTC TGCCGCGACT TTTTGGTCTT
GCCGCTCTCA TGCGCCGCCC GGCGGTGGCG CGGTTGGAGA TGTTGCGTTG GGCACTGTTC
CAGCTGGCGG TCGGCAACTC CGATGCGCAT GGAAAGAACT TTTCATTCTT CGTCGACCGA
ACCATGCTTG AGCCCGCGCC GTGGTATGAC GTCGTGAGCG TGGCTCAATA TCCGGAACTC
GACCAAAGCT TCGCAATGTC CTTCGGCGAT GCCTTCGGAT GGGAAGAACT CAACGCGATG
GAGCTTGCGC ATTTCGCCCA CCTATGCGGC ATCGATCAAA AGCTGTTGCA CCGGGAGACC
GAGCGGCTGT CCCGTGCGAT GAAGAGAGCA CCCGAACTTC TTTCCGCCCC GGTGTACACC
GAAGAGGAGC GTGACTTTCT GCACCCGATA TGCGAACTGG TGCAGCGGCG CAGCCAAACG
TTGGTGGAGC TGGCGGCCGG CGCCAGTGCC TTCACCGCCG AGCACTTCTA G
 
Protein sequence
MSGSEDNVYR LRVTHDGGTP VGELAYSRAE DRWSFRYDHA WARQGAFQLS PAFPLEPPPD 
GYDSHAIRRF IVNLFPEGAP LRAALEQLHV APSNAFALLR EMGGETTGAL EFQPFDQPPA
AAARREQRFL SREELSGRID AAKEGGLTVW DGRVRMSIAG YQDKLAVWAA HDLVHDTEAS
MWLPEPPLAS TFILKPQPAG PRTPHLVANE HYCMTLAGAY GAQVARVAIM RLRVPVLVVA
RFDRQWRAEE NHDWVTKLHV IDACQAADLS VDSKYERHLG NSPALAPYRD GMSLPRLFGL
AALMRRPAVA RLEMLRWALF QLAVGNSDAH GKNFSFFVDR TMLEPAPWYD VVSVAQYPEL
DQSFAMSFGD AFGWEELNAM ELAHFAHLCG IDQKLLHRET ERLSRAMKRA PELLSAPVYT
EEERDFLHPI CELVQRRSQT LVELAAGASA FTAEHF