Gene Vapar_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_1901 
Symbol 
ID7971080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2032452 
End bp2033513 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content67% 
IMG OID644792502 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002943816 
Protein GI239814906 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATCG AGACGCCCTC CTCTTCCCAT GCCAGCGCCC CGCCTTCGAG GCGCCTGGCG 
CTCAAGACCT TCGCTTCGCT CGCCTCCGTG GCGGCCATCG GCAGCCTCGG GCTTTCGGGC
TGCTCGCGCG AGGCCGGCGG CCCGGCCGTG GTGAAGAAGA CCGGCGACAA GGTGGCGTTC
AAGTACCCGA ACAACCCGTC GTTCGACCTG ATCTACCTGG CCGACGAGCT CGGCTACTTC
GACGGCACCA ACACCCGCCC CGAGTACGTC GGCAAGATCG CCGCGCCGCA GATCATTCCG
CTGGTGGGCA CGGGCGAGAT CGACTTCGGC AGCCGCATGG TGCCGCTGGT GATCTCGGCC
ATTGCGTCGG GCGCGGACCT CAAGGTGGTG GCCGCGGGCG GCAAGACGCT GCAGGAGGCG
CCGCACATGA AGTACTTCGT TCGCAAGGAC TCGGGCATCC GCAACCCGAA GGACCTGGAG
GGCAAGACCA TCGGCTTCAA CAGCTTCGGC GCCTGCGCCG AGTTCGTGAC CAAGAAGTAC
CTGCGCCAGC ACGGCGTGGA CGTGGCGAAG ATCAACTTCG TCGTGGTGCC CGACGAGCAG
GCCGAGCAGA CGCTGGTGAC CGGCAACACC GACCTCGCGA TCATCCACGC GCCTTTCTCG
GGCCGGGCCG ACAACGCCGA GCCCCTGGTG CGGCTGTGGA GCGACTACGA CCTCGACGGC
GGCCTTGGCG GCATGGCGCC GTACAGCGCG CATGGCCAGT TCATCCGCCA GCACCCGGAG
GCAGTGCGCG ACGTGGTGGC GGCGCTCGCC AAGGCCGGCA ACTGGGTCAA TGCCAACACC
GAGGAAGCGC GCAAGCTGGT GGCCAAGCGC ATCAGCATGG ACCTGAAGAA CGTCGACCGC
TACGCCTATG TCGACGACCT GGTGGTGACC GAGCCGCCGA TCCAGTACTA CATCGACATC
CTGCAGTCCG AGGGCAAGCT CGCCGCCGGC AAGGTGGCGG TGAAGGACGT CTACACGAAC
GAGTTCAATC CCTTCGCGCA GCAGCAGGCC GCGAAGTCCT GA
 
Protein sequence
MPIETPSSSH ASAPPSRRLA LKTFASLASV AAIGSLGLSG CSREAGGPAV VKKTGDKVAF 
KYPNNPSFDL IYLADELGYF DGTNTRPEYV GKIAAPQIIP LVGTGEIDFG SRMVPLVISA
IASGADLKVV AAGGKTLQEA PHMKYFVRKD SGIRNPKDLE GKTIGFNSFG ACAEFVTKKY
LRQHGVDVAK INFVVVPDEQ AEQTLVTGNT DLAIIHAPFS GRADNAEPLV RLWSDYDLDG
GLGGMAPYSA HGQFIRQHPE AVRDVVAALA KAGNWVNANT EEARKLVAKR ISMDLKNVDR
YAYVDDLVVT EPPIQYYIDI LQSEGKLAAG KVAVKDVYTN EFNPFAQQQA AKS