Gene Saro_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3191 
Symbol 
ID3917449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3410715 
End bp3411779 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content68% 
IMG OID640445975 
ProductGTP cyclohydrolase II 
Protein accessionYP_498460 
Protein GI87201203 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGAAT CGCGCAACGT TGCAAGGGCG CTCGACGCGC TGCGCCACGG CTGGGCGATC 
CGTGTGACCG GCCCGGAGGG GGCGCTCGAC CTGCTCCCCG CCGAAACCGC CTTCGTGCAG
CCCGGCATCT ACGCGGCCCG ACTGCTCATC TCCGCCGCCC GGGCCGCCAC GTTGAAGCTT
GCCAACCAGC GCGACGCCGC GGTGCCCGAA GCGCCGGTGA TGATCCACGG CGCGGAGCCG
TTCAGCCTGT CCGCCGCGCG CAACCTTGCC GATCCGGCGC AGGACCTTGG CTCTCCCTTG
CGCGGCCCGT TCAAGGCCGA TGCCATCGAA GCACATGAGG CCGCCGTCGC CGCGATGGAC
ATGGCGCGCC TTGCCGGCAT CCTTCCGGCG TTCCTGATCT CGACAGGCGT GGAAATCGCG
GCGGAAGTCT CCACCGCCGA TCTTGCCGCG TTCAAGGACC CGCTGAACCT TTCGATACAG
GCCCGCGCGC GCCTGCCGGT CCACGCCTGC GAGCATGCGG AAATCATCGC CTTCCGTGCC
CGCGACGACC TGCGCGAACA TGTCGCGCTC GTGCTAGGCA CCCAGACCAG CGAACGCGAG
CCGCTGGTGC GCCTGCACAG CGAATGCCTG ACGGGCGACG TGCTGGGCAG CCTGAAGTGC
GATTGCGGCC CGCAGCTCGA CGCAGCGTTG GCGCGCATGG CCGAGGAGGC CAATGCGGGC
GGCTGGGGCA TACTGCTCTA TCTCAGGCAG GAAGGGCGGG GAATCGGCCT GATCAACAAG
CTGCGCGCCT ACGAATTGCA GGACCAGGGG TTCGACACGG TCGATGCCAA CGAGCGACTG
GGACTGCCGA GCGAGGCGCG CGACTTCCCG GTCGCGGCGC GCATGCTTGA CCTGCTGGGC
GTGCGCAGCC TGCGCCTGTT GACCAACAAT CCGCAGAAAG TGGCGACATT GCAGGCGCTT
GGGCTGGAGG TGACGGAGCG CGTGGCGCAC CAGTTGCCGT CCAATCCGCA CAACCAGCGC
TATCTCGACA CCAAGCGAGA CCGGACCGGC CACCTCTTGC GATAG
 
Protein sequence
MSESRNVARA LDALRHGWAI RVTGPEGALD LLPAETAFVQ PGIYAARLLI SAARAATLKL 
ANQRDAAVPE APVMIHGAEP FSLSAARNLA DPAQDLGSPL RGPFKADAIE AHEAAVAAMD
MARLAGILPA FLISTGVEIA AEVSTADLAA FKDPLNLSIQ ARARLPVHAC EHAEIIAFRA
RDDLREHVAL VLGTQTSERE PLVRLHSECL TGDVLGSLKC DCGPQLDAAL ARMAEEANAG
GWGILLYLRQ EGRGIGLINK LRAYELQDQG FDTVDANERL GLPSEARDFP VAARMLDLLG
VRSLRLLTNN PQKVATLQAL GLEVTERVAH QLPSNPHNQR YLDTKRDRTG HLLR