Gene Vapar_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3969 
Symbol 
ID7974534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4212836 
End bp4213876 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID644794555 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002945849 
Protein GI239816939 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0791999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGTC GTCGATTCTT TTTTCTTGCG GCCAGTGCCG CCGCCAGCAC CCTTGTCGCT 
CCCGCCGTCT TCGCCCAGGG CGGCGGCAAG CCGACCCCGC TCAAGTTCAC GCTCGACTTC
CGCATCAACG GCCAGACCGC GCCCTTCTTC CTCGCGCACA GCAAGGGCTA CTACCGGGAC
GAAGGCCTGG ACGTGGCCAT CGACACCGGC GCCGGCTCGG TCGCCTCGAT CACGCGCATC
GCGAGCGGCG TCTACCAGAT GGGCCTGGGC GACATCAGCT CGCTGGTCGA GTTCAATGCG
CAGAACCCCG GCACCCCGAT GGTGCAGGCG GTGTACCAGT ACTACAACCG CGCACCCTTC
GTGATCATCG GCCGCAAGGA CCGCGGCGTC ACGGCCGACT TCAAAAGCCT CGCAGGCAAG
AAGGTGGCCG CGGCGGCCGT CGAATCGACC CGCCGCGCAT GGCCGATGGT GGCGCGCAAG
CAGGGCATGC GCAGCGACGC CTTCCAGTGG CAGACCACCG ACTTCAGCGC GCGCGACAAC
GTGATGGTGC GCGGCGACGT CGATGCCGCC ACCTACTTTC ACGACTCTGC CATTTCGCTC
TTCGCGCGCA TGAAGGCGGA GGAACTGTCG GTGCTCAAAT ATGCGGACGC GGGCGTCAAC
CTGTACGGCA ACGCCATCCT CGCGAGCAGC AACCTCATTG CGCAGAACCC CAGGGTGGTT
GCGGCCTTCC TGCGCGCCAC CAACCGCGCC ATCGTCGAGA CCTTTGCCAA TCCGGCGCCC
AGCATTGCGG CCATGCGCCA GCGCGAACCG ATCCTCGATG AGAAAATGGA GCTTGAACGC
TGGGGTGTCA CGGCGCAATA TGTCGGTGCC GCCGACACGC GCGGCCACGG CCTCGGCGAC
ATCCGCAAGC TCACGCTCGA GCAGCAGGTC GACGAGGTCG CCGACGTATT CGGCCTCAAG
GTCAAGCCCT CGTCCGACGC CATCTTCAAC ACGTCGATGC TGCCATCGCG CAACGAACGC
ATGATTCCCA CCAAGGCATG A
 
Protein sequence
MQRRRFFFLA ASAAASTLVA PAVFAQGGGK PTPLKFTLDF RINGQTAPFF LAHSKGYYRD 
EGLDVAIDTG AGSVASITRI ASGVYQMGLG DISSLVEFNA QNPGTPMVQA VYQYYNRAPF
VIIGRKDRGV TADFKSLAGK KVAAAAVEST RRAWPMVARK QGMRSDAFQW QTTDFSARDN
VMVRGDVDAA TYFHDSAISL FARMKAEELS VLKYADAGVN LYGNAILASS NLIAQNPRVV
AAFLRATNRA IVETFANPAP SIAAMRQREP ILDEKMELER WGVTAQYVGA ADTRGHGLGD
IRKLTLEQQV DEVADVFGLK VKPSSDAIFN TSMLPSRNER MIPTKA