Gene Vapar_3862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3862 
Symbol 
ID7969719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4090342 
End bp4091649 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID644794448 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002945742 
Protein GI239816832 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGACA GCAAGCTGCC ACGCTTCACC AGGCTCGACA AGTTCGATCC GCACCGCAAG 
GACTTCGTGT GCAAGCACGA GGCCGACGCC AATCCGCCCA TCACGGCGGA GTCCGAAGCC
CTGTTCCAGC AGGCGCTGGC GCTGGTGAGC TACGAGATGT GGTCGGAGAA CCGCAACTAC
GCGAAGGCGG CACAGCTGTA CGAGCAGGCA ATGAAGCTGG GCCATTGGAA AGCACAGTTC
AACCTCGCGG GGCTGTATCT GCAGGGTCTC GGTGTTGAGC AGAACCCCGA GAAAGCCATC
GAGTTAACCG AAGACTTGAT GCGCAAGGGC GTGCCGGCCG CCTGGGACAA CATGGGCACG
ATGTACATGG GCGGCATCGG ATCGCTCAAG CAGGATGCAA CGGTGGCCTA TGCGTTCTGG
CAGAAGGCAG CGGACATGGG AAGCATGACG TCGCAAACGT ATCTGGGGGC GAAGCTGAAC
GCGGATCACG ATGAGCCACC TGCGTTTTGG GGCAATCGCA GTACCGGGCT CAAGATGCTG
GAATGCGCGT TTGCTCAGGG CTCGGGACAA GCAGCTCTTA CTTTGGGCGC TACTCTCAAT
GTCGTTGAAA AGGATCATGT CCGCGCGCTC AGAGTCCTGC ATGAAGGTGT GAAGCTCGGA
AACGAGAAGA GTGCTAACTA TCTGGTTGGC GCATTTGGCA GAGGCGCTCC GCTAGTTGAT
GGTGTCGTCG ACAAGTCGCG TGCAGATCGC TACAGCGCTC TCGGCGATGC GCTCTACCAC
AACCCCGATC TGCGCTTTCC GAATCTTGAC AAAGTATTGC CCTTGCCGCC CGCGCCATTG
CCCCAGTGGG ACATGAGCGC CCCCAAGACG CTCATCGATG CCGCCAAGGC CGTGGTGCCT
GGTGCCTCTT CGCCACCGCA GCAAGCACCG GCGCCAACAT CTCAACGCAC CAGCCAGCTC
GAAAGCGCCG AACGAGGCAT GCCGGCCACT CACACGCGCG TGGCTCAAGG CATCGCGCGC
GAGGCCGACT TGCCGACACC GCCGGTTCGA TGCAACGGCG CCGGGCGCTG CCTCGTCACG
GGCATCTGGC AGGCGCGCGT GCCCGACGAC CACGCGCTTG CCGCATCGTT CAACCAGTGG
CATCGCCAGG CCTATGTGAT GGAAGGTCAG CCCTTCCCCG ATCCGCGCGA ACAGTACCTG
GACATCGATC CGGCGCAGGT CATCTGGACT TGGTGGAACC AGGCCAACCA TCTGGGCTTC
GCCAGGATTC CGCAGGTCAG CGTGGGCAAT CCGCCCCTCG CGGGGTGA
 
Protein sequence
MSDSKLPRFT RLDKFDPHRK DFVCKHEADA NPPITAESEA LFQQALALVS YEMWSENRNY 
AKAAQLYEQA MKLGHWKAQF NLAGLYLQGL GVEQNPEKAI ELTEDLMRKG VPAAWDNMGT
MYMGGIGSLK QDATVAYAFW QKAADMGSMT SQTYLGAKLN ADHDEPPAFW GNRSTGLKML
ECAFAQGSGQ AALTLGATLN VVEKDHVRAL RVLHEGVKLG NEKSANYLVG AFGRGAPLVD
GVVDKSRADR YSALGDALYH NPDLRFPNLD KVLPLPPAPL PQWDMSAPKT LIDAAKAVVP
GASSPPQQAP APTSQRTSQL ESAERGMPAT HTRVAQGIAR EADLPTPPVR CNGAGRCLVT
GIWQARVPDD HALAASFNQW HRQAYVMEGQ PFPDPREQYL DIDPAQVIWT WWNQANHLGF
ARIPQVSVGN PPLAG