Gene Vapar_5917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5917 
Symbol 
ID7974956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012792 
Strand
Start bp621834 
End bp623171 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content65% 
IMG OID644796485 
ProductHipA N-terminal domain protein 
Protein accessionYP_002947759 
Protein GI239820574 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.902916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCGTC GGTCACACAG CCAGTCCCTC GGCCTCTGGT CCAACGGTGA ACGCGTCGGC 
CGCTGGACAA TCCCTGCCCG CGGCGACATG GAGCTTCACT ACGACGACGC CTGGGTTCGA
TCGGACGTCG GTCGCCCGCT TTCCCTGTCC CTGCCCTTCA ACCCGCACAA CGAGCCCATC
AAGGGCGCCG CCGTCGAACA CTACTTCGAC AACCTGTTGC CCGAGAGCAA TGCCATTCGC
AAGCGCGTGG CGGCGCGCTT CAAGACCGGC TCGGTAGACG CCTTCCCTCT TCTGCGCGCT
ATCGGGCGCG ACTGCGTGGG CGCCGTTCAG CTTCTCGACG AGGCCCAGAC ACCCACGGCC
ACCGATCAGG TGGAAGCGGT GCAGGTCGAT GACGAGTCCA TCGAGCGGCA CCTGCTGAGC
GTCGTCAGTC CCGACAAGTT CGGCGCTTCT GAGGACCCGG ACGACGACTT CCGCATTTCC
CTGGCCGGCG CGCAGGAAAA GGATGCCTAT CTGTGGTGGA ACGGCGCCTG GCACAAGCCG
CGGGGCGCCA CCCCCACCAC GCACATTTTC AAGCTCCCGT TGGGCCTGAT CGGCGGCGTC
CGGGCCGACT TCTCCACCTC GGTGGACAAC GAGTGGCTGT GCTTGAAGCT GCTACACGCC
TACGGGCTCT CCACGGCAGA CGCCACCATC ACCTCGTTCG GGAAACAGCG CGTCCTCGTC
GTCGAACGCT TTGACAGGCG CATTTCGAAC GGCCGCCTCC TGCGGCTGCC CCAGGAAGAC
TTCTGCCAGG CGACGGGGAC GTCGCCGCTC ATGAAGTACG AGAGCGAAGG CGGACCCGGC
CTGCGCAAGC TCTTTGCACT GCTGCAGCAG TCCGCGACCG CGGCGGATGA CATGCGCACC
TTGATGGCCT CGCAGGTCCT GTTCTGGCTG CTGCGCGCGC CGGATGGACA TGCGAAGAAC
TTCAGCATTC ATCTGCTGGC CGGCGGCGGC TTCCGGCTGA CGAAGATGTA TGACGTGATG
TCGGCCTATC CCATCCTCGG CAAGGGCCCC AACCAGTGGG CGCCACGCGA GGTCAAGATG
GCCATGGCGC TTCTCGGGAA GAGCAAGCAC TACACCATGG CCGCCATCCA GCGCCGGCAC
TTCAACAGCA CCGCCCGACA GGTAGGCTAT GCGCACGACG CCGAAGCCAT CATCCAGCAG
CTGATTGAAC GCACGCCCCG CGCAATCAGC GAAGTGCAGG CGCAGTTGCC GAAGGATTTC
TCGCCATGGG TCGCCGAGCG TGTGCTGGGC GGGCTGCAGG CCGCGGTGGA CACGCTTGAA
GGGATGCCAT CCAACTGA
 
Protein sequence
MGRRSHSQSL GLWSNGERVG RWTIPARGDM ELHYDDAWVR SDVGRPLSLS LPFNPHNEPI 
KGAAVEHYFD NLLPESNAIR KRVAARFKTG SVDAFPLLRA IGRDCVGAVQ LLDEAQTPTA
TDQVEAVQVD DESIERHLLS VVSPDKFGAS EDPDDDFRIS LAGAQEKDAY LWWNGAWHKP
RGATPTTHIF KLPLGLIGGV RADFSTSVDN EWLCLKLLHA YGLSTADATI TSFGKQRVLV
VERFDRRISN GRLLRLPQED FCQATGTSPL MKYESEGGPG LRKLFALLQQ SATAADDMRT
LMASQVLFWL LRAPDGHAKN FSIHLLAGGG FRLTKMYDVM SAYPILGKGP NQWAPREVKM
AMALLGKSKH YTMAAIQRRH FNSTARQVGY AHDAEAIIQQ LIERTPRAIS EVQAQLPKDF
SPWVAERVLG GLQAAVDTLE GMPSN