Gene Saro_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3174 
Symbol 
ID3918216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3389088 
End bp3390464 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content66% 
IMG OID640445958 
Producthypothetical protein 
Protein accessionYP_498443 
Protein GI87201186 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTGC TCTTCGCCTT TTCGCTTAAA TCTGTTAGGG GCCTGTGCGT GACACCTTTT 
CCATGGTCTG ACGTCTTCGT GATCCTGGGC CTCGTCCTGC TCAATGGCCT GTTTTCCATG
TCGGAACTGG CGATCGTTTC GGCACGCCCG GCGCGCCTGA AGGTTGCCGC GGAGGAAGGC
AGCAAGGGCG CGAAGGTTGC GCTGGCGCTC GCAGCCGACC CCGGAAAATT TCTTTCGACC
GTACAGATCG GGATCACCCT CGTCGGCATC ATCGCAGGCG CCTATTCAGG GTCCAGCCTC
GGCGGGCCGA TGGCGGAGCG GCTCGCCGCA TGGGGTTTTC CGGCCCGTTA CGCGGACGAT
GCCGGGTTCG TCATCGTCAT CGCCTTCACC ACGTACCTGA GCCTCGTCGT CGGCGAACTC
GTACCCAAGC AGCTCGCGCT GCGTGCGGCG GAACCGATCG CCAAGATCGC GGCGCCCGCC
ATGGCGCTCA TGTCGAAGGT GACGGCCCCC TTCGTCTGGC TGCTCGACAA CTCGTCCAGC
CTGCTCATCC GCCTGCTCGG CCTCAAGCAG GGCACGGACC AGGAAGTGAC CGCCGAAGAA
CTCCACATGA TCTTCGCCGA GGCGACCCGC TCCGGCGTGA TCGAGGAGGA GGAGCGGGCG
CTGATGACGG GCATCATGCG CCTTGCGGAA CGCCCGGTGC GCGAAGTGAT GACGCCGCGA
ACCGAACTGC ACTGGATCGA GCGCAAGGCC CCCGAGGCCG AACTGCGCAG CGCGATCGAG
GACAGCCCGC ACTCGCTGCT GCTGGTGGCC GACGGGTCGG TCGACAAGAT CGTCGGCGTG
GTCAAGGTGC GCGACGTGCT GTCCACGCTG TTGCGGGGAC GCAAGGTCCA GCTCGGACGC
CTGATGAAGA AGCCGGCCAT CGTTCCGGAC CAGCTCGACA CGATGGACGC GCTCGGCATG
ATCCAGCAGG CCGAGGTCGC GATTGCGCTG GTCCACGACG AGTACGGCCA TCTCGAAGGC
ATCGTCACCC CGGCCGACCT GCTGTCCGCC ATCGCGGGCA ATTTCGTCGG CCACGCGGAC
GCGGGCGACG AACCCATGGT GGTCGAGCGC GAGGACGGTT CACTGCTGAT TTCGGGCGCC
CTGCCCGCCG ACGCCCTTTC CGACCGGCTG GGCCTCGACC TGCCCGACGA CCGTGAGTTC
GCGACGACGG CGGGCTACTG CCTTTCGGTG CTCAAGCGAC TGCCGAACGA GGGCGAGCAT
TTTCACGACC AGGGCTGGCG CTTCGAAGTG GTCGACATGG ACGGGCGCAA GATCGACAAG
CTGCTGGTCT GCCGCAGCAA GGCAATGCCC ATCGCCGCGC CGGAAGCCGA CGGCTGA
 
Protein sequence
MRVLFAFSLK SVRGLCVTPF PWSDVFVILG LVLLNGLFSM SELAIVSARP ARLKVAAEEG 
SKGAKVALAL AADPGKFLST VQIGITLVGI IAGAYSGSSL GGPMAERLAA WGFPARYADD
AGFVIVIAFT TYLSLVVGEL VPKQLALRAA EPIAKIAAPA MALMSKVTAP FVWLLDNSSS
LLIRLLGLKQ GTDQEVTAEE LHMIFAEATR SGVIEEEERA LMTGIMRLAE RPVREVMTPR
TELHWIERKA PEAELRSAIE DSPHSLLLVA DGSVDKIVGV VKVRDVLSTL LRGRKVQLGR
LMKKPAIVPD QLDTMDALGM IQQAEVAIAL VHDEYGHLEG IVTPADLLSA IAGNFVGHAD
AGDEPMVVER EDGSLLISGA LPADALSDRL GLDLPDDREF ATTAGYCLSV LKRLPNEGEH
FHDQGWRFEV VDMDGRKIDK LLVCRSKAMP IAAPEADG