Gene Saro_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3167 
Symbol 
ID3918209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3381637 
End bp3382617 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID640445951 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_498436 
Protein GI87201179 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGACG CAAAGCTTGA TCGCCGCTCG CTGATCGCGG CGCTGGCCGC CACCGGAGTT 
GCCGCCATGA CCGGAACCGA CGCCATCGCG CGCGCTGCCG CCCGCAAGCC CTTCTTCCAG
CGCATCGGCA AGCCTATCGG CTTGCAGCTC TACACCCTGG GCGACAAGCC GACGCAGGAC
CTCGACGGTA CGCTGGCGCG GCTTGCGGCC ATCGGCTTTA CCGACATCGA GTTGCCCAAT
TTCTACAATC GCACTCCCGC AGAGCTGCGT GCCGCTGCCG ACAAGGCTGG GGTCCGCTAC
AGTTCGATCC ACATGAACAT GCCGGGCCCG TTCACCGGCG GCGCACTCAG CCTGATGAGC
GCTCCCCAGG AAATCGCCGA CGGGCTGAAC ACGCTCGGCA TCCATCAGGT GTTCCTGCCG
CTTTGCCCCC TGCCCGAAGG CTTTTCGGTG CCCGAAGGAA AGAGCCCGCA GGTGGCGATC
GGCGACGCCT TGCGGGCAGC CGGCGCGGAC CACTGGAAGC GCACCGCGGC CCTGCTGAAC
GAACGGGGCG CAGCCTTGCG GCCCTTCGGC ATCCGGCTCG GCTACCACAA CCACAACATG
GAGTTCGCCC CGCTCGATGG CGGGGCGACC GGGTGGGACA TCCTGATCCG CGAGACCGAT
CCCGCGCTCG TCAATTTCGA ACTGGACCTG GGCTGGACTT CGGCCGCGGG ACACGATCCC
GTCGTCGAAC TGGGCAGGCT CAAGGGGCGG GTAAAGGCGG TGCACGTCAA GGACATCAAG
GCATCGACGA AAACCAACTT CGTCATGGGC CAGGATCCTA CCGAGGTGGG TTCGGGGCGC
CTGCAATGGG CGAAGATCCT GCCCGCTGCC CTCGCCGCGG GGGTCGAGCA CTTCTATGTC
GAGCAGGAAC CGCCATTCAC GATGGACCGC CTCGACGCGG TAACGAAAAG CCACGCATTC
CTGTCGCGCT TCGTGGCCTG A
 
Protein sequence
MHDAKLDRRS LIAALAATGV AAMTGTDAIA RAAARKPFFQ RIGKPIGLQL YTLGDKPTQD 
LDGTLARLAA IGFTDIELPN FYNRTPAELR AAADKAGVRY SSIHMNMPGP FTGGALSLMS
APQEIADGLN TLGIHQVFLP LCPLPEGFSV PEGKSPQVAI GDALRAAGAD HWKRTAALLN
ERGAALRPFG IRLGYHNHNM EFAPLDGGAT GWDILIRETD PALVNFELDL GWTSAAGHDP
VVELGRLKGR VKAVHVKDIK ASTKTNFVMG QDPTEVGSGR LQWAKILPAA LAAGVEHFYV
EQEPPFTMDR LDAVTKSHAF LSRFVA