Gene Vapar_3780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3780 
Symbol 
ID7970940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3995831 
End bp3998431 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content71% 
IMG OID644794367 
Productpentapeptide repeat protein 
Protein accessionYP_002945662 
Protein GI239816752 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCG TCAAGCCGCT GCGCCTGGGC ATTCTCACCC GGCCATACCG CTGGCAGGGC 
GTCGACCAGC TCGGCGTGGC CGTCACGGCG TTCGCCAGCC TCGAGGCGGA ACCGAAGCTC
ATGGCCGACC AGGAGCTGTG GCAGACCGTC GGCGAAGAGA TCGGCGGCCT GGGCGTGTTC
GACGCGGGCG TGCCCAAGTG CATGCCGGAG GTGCTCGCGA GCGGCCATGG CTACACCCAC
CACCAACAAG ACAAGACCGC CTGCGCCGTG CGCCTGCGCG TCGCCGATCT CGAGAAGTCC
CTGCACGTGT CGGGCGACCG CTACTGGCTC GACGGCCGCC TGACCGCACC GCAGCCCTTC
GAAGCCATGC CGCTGGACTG GGCGCACGCC TACGGCGGGC CGGGCGTGCC GGAGAACCCG
GCAGGCATCG GCGCGTTCGA CGAGCTCATC AACGGCGTCC GCACGCGCCG CGCACCCAAC
GTGGAGGCCG TTCATGCCCG CGTGCGCTCG CGCGGGCAGG CGGTGGCGCC CGCGAGCTTC
GGCGCCGTGC CCATCGACAG CCCGCAGCGC ATGGCCCTGA TGGGCAGCAA GTTCGGGCAG
CACTGGCTGG AGCACCACTT CCCGGGGTTC GCGCAGGACA TGGACTGGCG CTTCTTCAAC
GCCGCGCCCG AAGACCAGCG CTGGCCCGCG CGCCAGGAGC TCCCGCCCGG CGCGCCCTAC
GAAATTCTCA ACATGCACCC GGAAAAGCCG GTGCTGAGCG GCCGCCTGCC CGACTGGCGC
GCACGCTGCT TCGCAAGCTT CGACAAGGAA GGCCGCGGCC TGCACGAGAT CGGCCTGCGC
CTGACCACGG CCTGGTTCTT CCCGCACCGC GAGCGCGTGG CGCTGATCTG GCACGGCGTG
CGGCCGGTGC GCGAGGACGA TGCGGCAGAC GTGCGGCACA TCATGCCGGC ACTCGAGCTG
CCGCAGGAGC CGCGCGAGCT CGCGCATTAC CAATCCGTGC TGCTGCAGCG CCTGGACACC
GCGCGCGGCG GCCTGCTCGC CTTTCGCGAC AGCGACCTCG CACCCAAGGC GGTCCTCGGG
CGCTGGGGCA TCCTCGACGC GCCCGACCCG ATGGCGCGGC CGCTGCTGCG CAACCTGCGT
GCCGGCCAGC AGCGCGACCA CGAAAGCCGC CGCGCCGAAC TCGTCGCGCT GGGCGTCGAT
CCCGACAAGT ACCTGCCCGC GCCCCAGCCC CCGGCCGCGC CGCTGGAGTT CGACGACGTG
CCCGAGTACT TCGAGCGCAT GCAAGGGGAA ATGATGGAAG CCCGAAAAAA CCTGCAAGCC
CAGGGCGAAC AGATGCGCCG CAGGCAGGAG GAAGAACTCG ACCCCGCGCT GCTGCAGAAA
AGCCGCGAGC AGCCGCAGCG CTTCGATCCC GACGCGCTGA TACGGCAGCT CGAACAGCTT
GCCGCCGCGC CACCCGCCGG GACCCCCGCA AGCCCCCCGC CCACGCTCAT CTCCCTGGGC
ACCAAGGATC GGATGACCGC GCAGATCCGG CAGGGCTACC TCCACGGCGC CCACCTGTCC
GACGCCGCGC CGCCCATGCC CTCGTTCCGC GCCGCGAAGA TCCGGCGGCG GCTGGCCGAG
GCCGCGCCCG GGGCGCGCAA TTTCTCGGGC ATGCGGCTCG TCGGGGCCGA TCTCTCGGAC
ATGGACCTGC GCGGGGCCGA CTTCTCCGGC GCCGCACTCG AAGACGCCAA CCTCGACAAC
GCGCAACTGT CGGACGCCAA CTTCAACGGC GCCGTGCTGG CACGGGCCCG GCTCTCGCGC
ACCTCGCTGG CCAGTGCGAC CTTCCGCAAC GCCAACCTCG GCGGCGCGCA CTGCGAGTTC
GCCGATTTCT CCGGCGCCGA CCTGAGCAGC GCCAACTGCG AGAAGACGCG CTTCGCGTCC
TGCAGCATGG CCAACACCGT GCTCGACCAG ACGCGCTTCA CCGCAAGCGA GATGAGCCAT
TGCGACTTTC GCGGCTCCGA CTGGCACCAG GTCTTCCTGA CCAAGCTGCG CATGAGCGGC
ATGGCCTTCG ACGGCGCCTC CTTCCAGCAG GTGGTATGGC TCGAATGCAC CCTGGCGGAT
GTGCGCTTTG CCAATGCGTC CCTGGTGCGC TGCAGCTTCG TCACGAGCGA CTGCAGCCGG
TCGGTCGATT TCTCCGACGC CCGGCTCGAT GCCTGCAGCT TCGCGCACGG CAGCACGCTG
GCCGGCGCGG TATTGCGCCG CGCCGCGCTC AAGCAATGCG GCCTGCGCAC GACGCCGCTG
CAGCAGGCGG ACCTGCGCGA AGCGCGCCTG GACAACTGCG ACTTTTCGGA ATGCGCGCTG
CAGGGCGCCA AGCTCGAGCG GCTCGTCGCG GGCGAGAGCC TCTTCGTGCG CGCCGACCTC
ACCGGCGCCA GCTTGCGCGG CGCCAACCTC ATCGACGCCA ACTTCTCGAA GGCCGTCTTC
GTGCAGGCCG ACCTGAGCGG CGCCAACCTG TTTCGCACCG ACGTGTCGCA AAGCCTGATC
GACGGCAGCA CCCACCTGCT CGGCGCCTAC ACACGGCACG CCAAGACCTG GCCCGCACGG
CGCGCCGCAG CGCCCGAATG A
 
Protein sequence
MKTVKPLRLG ILTRPYRWQG VDQLGVAVTA FASLEAEPKL MADQELWQTV GEEIGGLGVF 
DAGVPKCMPE VLASGHGYTH HQQDKTACAV RLRVADLEKS LHVSGDRYWL DGRLTAPQPF
EAMPLDWAHA YGGPGVPENP AGIGAFDELI NGVRTRRAPN VEAVHARVRS RGQAVAPASF
GAVPIDSPQR MALMGSKFGQ HWLEHHFPGF AQDMDWRFFN AAPEDQRWPA RQELPPGAPY
EILNMHPEKP VLSGRLPDWR ARCFASFDKE GRGLHEIGLR LTTAWFFPHR ERVALIWHGV
RPVREDDAAD VRHIMPALEL PQEPRELAHY QSVLLQRLDT ARGGLLAFRD SDLAPKAVLG
RWGILDAPDP MARPLLRNLR AGQQRDHESR RAELVALGVD PDKYLPAPQP PAAPLEFDDV
PEYFERMQGE MMEARKNLQA QGEQMRRRQE EELDPALLQK SREQPQRFDP DALIRQLEQL
AAAPPAGTPA SPPPTLISLG TKDRMTAQIR QGYLHGAHLS DAAPPMPSFR AAKIRRRLAE
AAPGARNFSG MRLVGADLSD MDLRGADFSG AALEDANLDN AQLSDANFNG AVLARARLSR
TSLASATFRN ANLGGAHCEF ADFSGADLSS ANCEKTRFAS CSMANTVLDQ TRFTASEMSH
CDFRGSDWHQ VFLTKLRMSG MAFDGASFQQ VVWLECTLAD VRFANASLVR CSFVTSDCSR
SVDFSDARLD ACSFAHGSTL AGAVLRRAAL KQCGLRTTPL QQADLREARL DNCDFSECAL
QGAKLERLVA GESLFVRADL TGASLRGANL IDANFSKAVF VQADLSGANL FRTDVSQSLI
DGSTHLLGAY TRHAKTWPAR RAAAPE