Gene Saro_2328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2328 
Symbol 
ID3915673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2473346 
End bp2474914 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content69% 
IMG OID640445084 
Producthypothetical protein 
Protein accessionYP_497599 
Protein GI87200342 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.338173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCCGC CTGATCTCGC GCTCAGGGCC CCGCTCGGCC GCGCCTGGAC CGCGCTGCCC 
GCCCAGTGGC GGCGTGCGCT CGGCCTGCTC GCGCTGGCGT GGCTCGGCAA CGTATTGCTC
TTTGCGGCCG ACTGGCGCGC GATGTTCCTG CAATGGTGGG ACAGCTCGAC ATACAATCAC
GTCCTGCTGA TCCCCTTCAT CCTCGGCTGG CTCGTCTCGC TGCGCTGGCG CGAGGTGGTG
AAGGTCGCGC CGCAGGGCTG GTGGCCAGGG CTGGTCCTCT TCGCGGGAGC CGGCTTCCTC
TGGCTGCTTG GCGATTTCGC GGGCCTTTCG CTTGCCACGC AGCTTGGCGT GGTGCTGATG
GCGCAGGCGA GCGCGCTGAC GCTGCTGGGG CCGCGCGTTT CCGCCGCGCT GGCGTTCCCG
CTCGCCTACA TGCTTTTCCT CGTCCCCGCC GGCGACGAGC TGATCCCCAC GCTGCAGACG
ATCACTGCGC GAATCACCAT GGCGCTGCTC GACCTCAGCC AGGTCCCGGC GCACATCGAG
GGCGTGTTCA TCACCACCCC CGGCGGCTAT TTCGAGGTCG CGGAGGCGTG CTCCGGTGTG
AAGTTCCTGA TCGCGATGGT CGCCTATGGC GCGCTGGTCG CGAACGTCTG CTTTGCCACC
TGGACCCGCC GTGCGGCGTT CATGGCGCTG AGCGTGGCCA TGCCGATCCT CGCCAACGGC
GTGCGCGCAT GGGGCACGAT CTTCATCGCC GAACACCACG GCATCGAATT CGCGGCGGGC
TTCGACCACG TGTTCTACGG GTGGATATTC TTCGCCATGG TCATGGCGAT GGTCATGGTC
CTCGCATGGC GCTTCTTCGA CCGCGCGATC GACGACCGCA TGATCGATCC CGACGCCATC
GCCGCAAGCC CTGTCCTTGG CCGCCTGTCC GGCTTTGCCA TGGCGCAACC CCGGGCCCTT
GCGGCCGCAG CCGCCATCGC CCTGCTGTTC GCAGGCTGGG GCGCCTGGGC CAACAGCCTC
GAGGCCGCGA TCCCCGCTCG CATAGATTTC CCCGGTGTGC CGGGCTGGCA GCAAGTCGAC
TATGCCCCCC AATCGCACTG GAAACCGCTC CACGGAGGCG CTTCCCACGT CCTCCTGGGC
CGCTTCCGCG ACGGTGCGGG CCACACCGTC GACGTTTCCT ACGCGCTCTA CGCCATGCAG
GCAGACGGGC ACGAGGCGGG AGGTTTCGGG CAGGGAGCCA TCCCGCTTGG CGGTGGCTGG
GCCTGGGAGC GCTCTGCCGC CCCGCTCGCG GGAGGCCACG CCGACCGCAT CCAGACAGCC
GGTCCCGTCC ATCGCCTGGC AGAGACCTTC TACCGCTCGG GCAGCCTCTT CACCGGTAGC
AACACCCGCC TGAAACTGCG GAATATCCTC GACCGTCTGC TGTTGCGGGA GCGGACCACG
GCCACGCTCA TCCTCTCCGC CGAAGACGAC ATTCCCGGCC AGCCGTCGGC CGAACAGTCC
ATGCGCGCCT TCCTGTCGGC CATCGGTCCG GTCGATGCGT GGATGGACCG CGCTGCCTTG
CCCCGCTAG
 
Protein sequence
MSPPDLALRA PLGRAWTALP AQWRRALGLL ALAWLGNVLL FAADWRAMFL QWWDSSTYNH 
VLLIPFILGW LVSLRWREVV KVAPQGWWPG LVLFAGAGFL WLLGDFAGLS LATQLGVVLM
AQASALTLLG PRVSAALAFP LAYMLFLVPA GDELIPTLQT ITARITMALL DLSQVPAHIE
GVFITTPGGY FEVAEACSGV KFLIAMVAYG ALVANVCFAT WTRRAAFMAL SVAMPILANG
VRAWGTIFIA EHHGIEFAAG FDHVFYGWIF FAMVMAMVMV LAWRFFDRAI DDRMIDPDAI
AASPVLGRLS GFAMAQPRAL AAAAAIALLF AGWGAWANSL EAAIPARIDF PGVPGWQQVD
YAPQSHWKPL HGGASHVLLG RFRDGAGHTV DVSYALYAMQ ADGHEAGGFG QGAIPLGGGW
AWERSAAPLA GGHADRIQTA GPVHRLAETF YRSGSLFTGS NTRLKLRNIL DRLLLRERTT
ATLILSAEDD IPGQPSAEQS MRAFLSAIGP VDAWMDRAAL PR