Gene Vapar_6086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_6086 
Symbol 
ID7975532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012792 
Strand
Start bp804261 
End bp805601 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID644796642 
ProductHipA N-terminal domain protein 
Protein accessionYP_002947916 
Protein GI239820731 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCA CACGGGGCCT GCAGGCCCTT TCCATCTGGG CCAACGGTGA ACGGGTCGGC 
AGCTGGCGCA TTCCCGCGCA CGGGGCGGAC GAACTGCGCT ATGACGATGC CTGGGCCGAT
TCGCCCGCCG GGCGCCCGCT CTCGCTTTCG TTGCCGCTGG TGCGCGGCTT CACCCACAAG
GGCGCGGTCG TCAGCAACTA CTTCGACAAC CTGCTGCCGG ACAGCCTGCC CATACGCCAG
CGCATCGCGA GCCGCTTCGG CACGCAGACG ACGCAGGCCT TCGACCTGCT GCAGGCGATC
GGCAGGGATT GCGTGGGCGC CATCCAGCTG CTGGGAGAGA ACGCCAGCCC GGCCGATGTG
GAACGCATCG AGGGCGAACC GATGAGCGAA GCCGGCGTCG AGCGGCTGCT GCTGCAGACG
GTCGATCCGG GCAGGTTCGC CGCGCAGGCC GTGCCTGGCG ACGAACTGCG CATCTCGCTG
GCCGGCGCCC AGGAGAAAAC CGCGCTTCTG TGGCACGAGG GCCAGTGGCT GCGCCCGCAG
GGCTCGACCC CCACCACGCA TATCCTGAAG CTGCCCTTGG GCCTGGTCGG CCATCGCAAG
GCCGACTTCA GCACCTCGGT GGAGAACGAG TGGCTCTGCC TGAACATCCT CCAGGCGTAC
GGCCTTCCCG TGCCCCGCAC CGCGATGCTC CGGTTCGGTT CGCAGAAGGT GCTGGCCGTC
GAGCGCTTCG ACCGCCGGCT GCACTCCTCC GGAAACTGGT GGCTGCGCCT GCCGCAGGAA
GACTTCTGCC AGGCCCTCGG CAAGCCGTCG CACCTGAAAT ACGAAGCGGA CGGCGGACCG
GGAATGACCG ACCTCGCCGA CGTGCTGCGC AACTCAGTCA ACGCCCAGGA AGACCTGGCA
ACGCTCCTCA CCGCGCAGCT GCTCTTCTGG ATGCTGGGCG CGCCCGACGG GCATGCCAAG
AACTTCAGCA TCGCCTGGCT CCCGATGGGC CGCTACAGGC TGACGCCCCT CTACGACGTG
ATGTCCATCT GGCCCCTGGA AGGCAACGGC CCGAACCAGT TTTCCAGGCA CGAGGCCAAG
CTCGCGATGG CCTTGTCCGG CAAGAGCAGG CACTACCACT TCAAGACCAT CCAGCGGCGC
CACTTCAACG CCATGGCACA GAAGTGCCAC TACGACCCGG ATGCCGAGAA CATCATCCAG
CGCGTGCTGG CGGCAACGCC CGGCGTGATC GATCGGATCG CCGCGCGCCT GCCCGCGCAG
TTTCCGGTAG CGGTGTCGGG CCGGATCCTC GAAGGGCTCG CCCGCTCCGC GAGGGCGCTG
AAGGGAATGC CGCCCGTCTA G
 
Protein sequence
MARTRGLQAL SIWANGERVG SWRIPAHGAD ELRYDDAWAD SPAGRPLSLS LPLVRGFTHK 
GAVVSNYFDN LLPDSLPIRQ RIASRFGTQT TQAFDLLQAI GRDCVGAIQL LGENASPADV
ERIEGEPMSE AGVERLLLQT VDPGRFAAQA VPGDELRISL AGAQEKTALL WHEGQWLRPQ
GSTPTTHILK LPLGLVGHRK ADFSTSVENE WLCLNILQAY GLPVPRTAML RFGSQKVLAV
ERFDRRLHSS GNWWLRLPQE DFCQALGKPS HLKYEADGGP GMTDLADVLR NSVNAQEDLA
TLLTAQLLFW MLGAPDGHAK NFSIAWLPMG RYRLTPLYDV MSIWPLEGNG PNQFSRHEAK
LAMALSGKSR HYHFKTIQRR HFNAMAQKCH YDPDAENIIQ RVLAATPGVI DRIAARLPAQ
FPVAVSGRIL EGLARSARAL KGMPPV