Gene Vapar_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_0204 
Symbol 
ID7971413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp216004 
End bp217080 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID644790807 
Productpentapeptide repeat protein 
Protein accessionYP_002942133 
Protein GI239813223 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.550889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCGG CCAGTGTGCA GGCGATGATC CATGGCGGAG AAACGATCGC TCGGAAGCAA 
CTTCAAGCGC TCGACTTTCG GGGCATGGAC CTCACGGGTG CCATTTTCCT GGAGACCGAT
CTGAGTGATG CCGATCTGTC CGGTGCCAAG CTCGATGCGA GCATCTTCAA GCAGTGCAAG
CTTGCCGGGG CCCGGCTTGA TGCCGCGCAT GTGGACCGTT GTGCATTCGA GCATTGCGAT
GCAACCGGGA TCCGGATGGT CGGCGCCACG CTCGACGGTG CCGCCTTCTA CCAATGTGGA
CTGGCCGGCG CAAAGCTCGA TGACGCAAAT GCCGGCGTTG CCTCTTTCGC CAACTGCACG
CTCGACCGCG CCTCGTTCGT CAATGTGCTG TCGGAAGCCC TGGTGTTCTC CGAGACCTCG
CTCGATGCCA CCGACTTTTC CACGACGCTC CTGCACAAGA CAGTGTTCTA CCGGGCAGAT
CTTCGATCGG CAATCTTCCG TGGCAGCCGG TTCGACAAGG CTGTCTTCTC CGAGGCCAAG
CTCTCGGGAC AGAGATTCAA TGCGCAGAAG TTCCTGCTTT GCCAGTTCAT CGACGCAGAA
CTCGACGGAT GCGATTTTGA CGAGTCCATG CTTGTGCAAT GCAACTTCAA GGGCGCCAAG
CTCGAAGGTG CGAGCCTGAA CCGGGTCCAT GCGCCCCAAT GCCTCTTTCC GTCGGCCAAG
CTCGATGGCG TCGGTTGCCG CGGCGCCCAT TTCAATCAAA GCATCTGGGT CGAGGTGCAG
GCGCGCGGAG CCGATTTCTC CGACGCGAAA CTGGAGCAGT GCATCTTCCA GCGTGCCAGC
TGCAGCCATG CCGTCTTCGC AAGGGCGGAC CTTACATACG CCGATTTTTC TCACGCCGAT
CTGCGTGGTG CCGATCTGCG GGGCGCTCGC TTCCTGCGCA CCAAGGTTCA CCGCGCGCAA
CAGGGGGGAA CGAGATATGG CGACCGTGGT GGCTTGCTGT TGAACGACCC CGACCTGTTT
GCCGCCGAGG AGTTCTCGGC ACGCCGCCTC GCGCGTCCGG GCGCATATCC CTCTTGA
 
Protein sequence
MTPASVQAMI HGGETIARKQ LQALDFRGMD LTGAIFLETD LSDADLSGAK LDASIFKQCK 
LAGARLDAAH VDRCAFEHCD ATGIRMVGAT LDGAAFYQCG LAGAKLDDAN AGVASFANCT
LDRASFVNVL SEALVFSETS LDATDFSTTL LHKTVFYRAD LRSAIFRGSR FDKAVFSEAK
LSGQRFNAQK FLLCQFIDAE LDGCDFDESM LVQCNFKGAK LEGASLNRVH APQCLFPSAK
LDGVGCRGAH FNQSIWVEVQ ARGADFSDAK LEQCIFQRAS CSHAVFARAD LTYADFSHAD
LRGADLRGAR FLRTKVHRAQ QGGTRYGDRG GLLLNDPDLF AAEEFSARRL ARPGAYPS