Gene Vapar_3869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3869 
Symbol 
ID7969726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4098385 
End bp4099398 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content65% 
IMG OID644794455 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002945749 
Protein GI239816839 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.478553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCTT CTTTCTCACG CCCCGTATCG CGCCGGGGTG TCCTGCGCGG AGGCGCCGCC 
GCTGCCGTGG TCGCCTCTGG CGGCCTGATC GCCTCGCAGG CCTTCTCGCA GCAGGCGCGC
AAGCTCACCT TTGCATGGAA CGCCGCCGCG TTCTGCCTCT CGCCCGTCGT CGTCGCGCAG
GAGCGCGGCT ACTTCGAGCG CAACGGCCTG CAGGTGGACC TGATCAACTA CACCGGTTCC
ACCGACCAGC TGCTGGAGTC GCTGGCCACG GCCAAGGCCG ATGCGGCGGT GGGCATGATC
CACCGCTGGC TCAAGCCGCT GGAGTCGGGC TTCGACGTGA AGATCGTCGG CAGCTCGCAC
GGCGGCTGCG TGCGGCTGGT GGGTGCGAAG AGTGCGGGCG CGACCAGCCT TGCGAGCCTC
AAGGGCAAGA TCATCGGCGT GTCGGACATC GCGAGCCCCG GCAAGAACTT CTTCTCGATC
CTGCTCGCGA AGAACGGCAT CGATGCCGAC AGGGACGTGA CCTGGCGCCA GTACCCGGCC
GACCTGCTCG ACATTGCGGT GCAGAAAGGC GAGATCCACG CCATTGCCGA TGGCGACCCG
AACGTTTACC TGATCGAAAA GCGCAACAAG GACGCCTTCG TGGAGATTGC GAGCAACCTC
TCGGGCGAAT ACAAGGACAA GGTCTGCTGC ATCGTCGGCG CGCGCGGCGA ACTCGTTCGC
AAGGACAAGC CGACCGTCGC GGCCCTCGTG CGCGCCATCG CGCAAGCCTC CGACTACGTG
GCCGAGAACC CGAACGAATC GGCCAAGCTG TTTGCGAAGT ATTCGCCCAA GGTGCCGGTC
GAAGACCTGC GCGCGCTGCT TGGCACGCTC ACGCACAACC ACCATCCGCT CGGCAGGAAC
CTGCGCGACG AGGTGGAGTT CTATGCGCGA GATTTCCGCG GCGTTGGCGT GCTCAAGAAG
ACCACCGATC CGGTGCGCTT TGCCGAGCAC GTCTCTTTCG ATCCACTCGC ATGA
 
Protein sequence
MTASFSRPVS RRGVLRGGAA AAVVASGGLI ASQAFSQQAR KLTFAWNAAA FCLSPVVVAQ 
ERGYFERNGL QVDLINYTGS TDQLLESLAT AKADAAVGMI HRWLKPLESG FDVKIVGSSH
GGCVRLVGAK SAGATSLASL KGKIIGVSDI ASPGKNFFSI LLAKNGIDAD RDVTWRQYPA
DLLDIAVQKG EIHAIADGDP NVYLIEKRNK DAFVEIASNL SGEYKDKVCC IVGARGELVR
KDKPTVAALV RAIAQASDYV AENPNESAKL FAKYSPKVPV EDLRALLGTL THNHHPLGRN
LRDEVEFYAR DFRGVGVLKK TTDPVRFAEH VSFDPLA