Gene Vapar_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3863 
Symbol 
ID7969720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4091771 
End bp4093090 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID644794449 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002945743 
Protein GI239816833 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.986909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGACA GCAAGCTGCC GCGCTTCACG AGGCTCGACA AGTTCGATCC GCACCGGAAG 
GACTTCGTGT GCAAGCATGA GGCCGAAGCC AACCCGCCCA TCACGGCAGA AGCCGAAGCA
CTGTTCCAGC AGGCGCTGGC GCTGGTGAGC TACGAGATGT GGTCGGAGAA TCGCAACTAC
GCCAGGGCGG CACAGCTGTA CGAGCAGGCA ATGAAGCTGG GCCACTGGAA GGCGCAGTTC
AACCTGGCGG GGCTGTATCT GCAAGGTCTA GGCATCGAGC AAGACTCCGA GAAAGCCATC
GAGTTGACCG AAGACTTGAT GCGCAAGGGC GTGCCCGCGG CCTGGGACAA CATGGGCACG
ATGTACATGG GCGGCATTGG GTCGCTCAAG CAGGATGCGA CCGTGGCGTA CGCGTTCTGG
CAGAAGGCTG CAGACATGGG GAGCATGGCA TCGCAGGCGT ACATCGGAGC CAAGTTGAAG
GCCACGCACG ACGAACCACC GACCTTCTGG GGAAATCGCC TTGTCGGGTT GAAGATGCTT
GAATGCGCGT TCGCCCAAGG CTCTGCGAAG GGCGCCTATG AATTGGCGAT CACACTCGTG
GGCAACAATC CCGCGCTTCA AGAGAATGAC GAGCGGGCAC TCAGAGTCTT TCACGAAGGT
GTCAAGCTTG GCAGTCAACA AAGCGCTGGC TATTTGAGTT CTTCATTTCG ACACGGAGAA
AAGCCCGTAC AAGGAGGTCC GGATACATCA AGGGCTGATC GCTACCACGC TCTTGCCAAC
GCGCTTTACT ACAACCCCGA CCTGCGCTTC CCCAACCTCG ACAAGGTGCT TCCCCTTCCC
CCGGCACAGT TGCCCCAGTG GGACATGAGC GCCCCCAAGA CGCTCATCGA TGCCGCCAAG
GCGGTGGTGC CGCCTGCCTC TTCGCCACCG CAGCAAGCAC CGGCATCAAC ATCCCAACGC
ACCAGTCAGT TCGAAAGCGC CGAGCGAGGC ATGCTGGCCA CTCACACGCG CGTGGCCCAG
GGCATCGCAC GAGAAGCCGA CTTGCCGAAA CCACTGGTTC GATGCAGCGG CGCCGGGCGC
TGCCTTGTCA CGGGCATCTG GCAGGCACGC GTGCCCGACG ACCACGCGCT CGCCGCATCG
TTCAACCAGT GGCATCGCCA GTCCTATGTG ATGGAGGGCC AGCCCTTCCC CGATCCGCGC
GAACAGCACC TGGACATCGA TCCGGCGCAG GTCATTTGGA CCTGGTGGAA CCAGGCCAAC
CATCTGGGCT TCGCCAGGAT TCCGCAGGTC AGCGTGGGCA ATCCGCCCGT CGCGGGGTAA
 
Protein sequence
MSDSKLPRFT RLDKFDPHRK DFVCKHEAEA NPPITAEAEA LFQQALALVS YEMWSENRNY 
ARAAQLYEQA MKLGHWKAQF NLAGLYLQGL GIEQDSEKAI ELTEDLMRKG VPAAWDNMGT
MYMGGIGSLK QDATVAYAFW QKAADMGSMA SQAYIGAKLK ATHDEPPTFW GNRLVGLKML
ECAFAQGSAK GAYELAITLV GNNPALQEND ERALRVFHEG VKLGSQQSAG YLSSSFRHGE
KPVQGGPDTS RADRYHALAN ALYYNPDLRF PNLDKVLPLP PAQLPQWDMS APKTLIDAAK
AVVPPASSPP QQAPASTSQR TSQFESAERG MLATHTRVAQ GIAREADLPK PLVRCSGAGR
CLVTGIWQAR VPDDHALAAS FNQWHRQSYV MEGQPFPDPR EQHLDIDPAQ VIWTWWNQAN
HLGFARIPQV SVGNPPVAG