Gene Saro_0246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0246 
Symbol 
ID3917597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp254975 
End bp256591 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content69% 
IMG OID640442973 
ProductType I secretion outer membrane protein, TolC 
Protein accessionYP_495528 
Protein GI87198271 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCCAA AGGCGAACGG CGGCGTGGCA TGGCTGGCTC TGCTTCTGGC GGAAACAGCC 
ACAGGCACCG TCGTATCCAC AGTCGCACCC ACCGTCGCCC GGGCGCAGGA GATCGCGAGC
ACTTTCGTCG AGGCGCCGGA ACTGCCACCC GAAACCCCGT CCGACATCGC CGACCAGTTG
CGGGAACCGA CCCCCGTGGC GGCCCTGCCG GACGCGCTGC GCCGTGCCTA CTGGTCCAAT
CCCAGCCTTC AGGCCCAGCG CGCATCGGTA CGCGGGGCGG ACTGGCGCAT TCCCCAGGCC
CGCGCCGCCT ATGGACCCAA GCTCAGCGCA TCGGGCACCT ATGGCTGGCA GCGCGACAAC
TTCGAGACCC CGGCGGGCGT CTATACCGCG TTCAATGGCT GGACGAGCAC GGCACAGGCG
ATCCTCACGC AGCCCCTGTT CACCTTCGGG CGAAATGTCG CGGCCGAACA GTTCGCGTCC
GCGCAAGTGG AATACCAGCG CAACGTCCTG CGCTCGACCG AGCAGCAGAC CATGCTCGAT
GCGATTGGCG CCTATGTCGG CGTGCTGCGC GACCGCGCCG CCGTGGGCAT CGCGCGCGAC
AACCTTGCGC TGCTCGAACA GGAGCTTTCC GACAACCAGG CCCGCTTCAA CGCGCGCGAG
GTGACGTCGA CCGACGTGCA GCAGGTGGAA ACCCGCGTCG ATCTGGGCCG GGCACAATTG
CTCGCCGCGC AGCGTGCCGC CGCCGGAAGT GAGGCCACGT TCCTTCGTAC CACCGGTGCG
CCGGCCGCCG AGAATGCCGC CGCGCCCAAT CCGTTGAGCC TGCCCGTGCG GACGATCGAG
GAGGCCTATC TCTTTGCCGA ACTGCACAAT CCCGTGCTGT TCGCGGCCCA GGCGCGCGAG
AAGGTTTCGC GGGCCCAGGC GGCCAGCGCG CGGGCGGACC TGATGCCGCG CGTCGACCTG
CGCGGATCAG CCGATTACGG CACGCTTTCG CCCTATTCGA ATGCGCTTCG CCAGAACACC
CTGCGCGGCG AGGTGGTGCT GAGCGCACCG CTGTTCGAAA GCGGCGTGCG CCGCGCGCGT
CTTGCCGAGG CGGATGCGGC GAACGATGCG GACTGGCGGC TCGTCGATGC GGCCATGCGC
GAAAACCGCG CCGCGATCGC CGATGCCTGG AGCGAATGGC AGGCGCAGAC CGGGGGCATC
GCCCGGCTTG GCGAAGCGGT CGAATCCGCG CGCAAGGCCT ATGACGGCGC GCTGCTCCAG
GAACGTGCCG GTCTGCGGAC CACGCTGGAC GTGCTCGATC TTGCACGGGA ACTGCTTTCG
GCCCGCAACG GCTACAACAA TGCCATTGCC GGGGCGACAA TCGCCAAGGC GCGCCTGCTC
TTCGCGATGG GCTCGCTCGA CTATGCGTGG CTGATGCCCG ACGAGGCGCG ATACGATGCG
GACGGGCACC TGCAGGACGT GCGTCACAAG GGTGACGTGC CGCTGCTCAC CCCGCTGTTC
CGCGCGCTCG ACAGCGTCGT CGCCGGCGGC GGCAAGCCGC GCCCGCTGCG CGATCCTTCC
GCCAAGGCGA CCACGTCGGG CTTCACCCTG ACCGAGCAGC CCGCGCCGGG CCAGTAG
 
Protein sequence
MRPKANGGVA WLALLLAETA TGTVVSTVAP TVARAQEIAS TFVEAPELPP ETPSDIADQL 
REPTPVAALP DALRRAYWSN PSLQAQRASV RGADWRIPQA RAAYGPKLSA SGTYGWQRDN
FETPAGVYTA FNGWTSTAQA ILTQPLFTFG RNVAAEQFAS AQVEYQRNVL RSTEQQTMLD
AIGAYVGVLR DRAAVGIARD NLALLEQELS DNQARFNARE VTSTDVQQVE TRVDLGRAQL
LAAQRAAAGS EATFLRTTGA PAAENAAAPN PLSLPVRTIE EAYLFAELHN PVLFAAQARE
KVSRAQAASA RADLMPRVDL RGSADYGTLS PYSNALRQNT LRGEVVLSAP LFESGVRRAR
LAEADAANDA DWRLVDAAMR ENRAAIADAW SEWQAQTGGI ARLGEAVESA RKAYDGALLQ
ERAGLRTTLD VLDLARELLS ARNGYNNAIA GATIAKARLL FAMGSLDYAW LMPDEARYDA
DGHLQDVRHK GDVPLLTPLF RALDSVVAGG GKPRPLRDPS AKATTSGFTL TEQPAPGQ