Gene Saro_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3239 
Symbol 
ID3917497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3460109 
End bp3461353 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID640446023 
Producthypothetical protein 
Protein accessionYP_498508 
Protein GI87201251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGGC ACCTTGAAAT GACCGCACGG AATGTGACGG AAGCGCCGCA CGAGACGTTG 
TTGCTGCGCA TTTGCGCTGC GCCGGATGCT GACCACGAGA TGGCGTTTCG GGGGCTGGAC
GAGGAGGGCT GGAGCGACCT TGCCGCGCTC GCGGAGGACA AGCGCGCTGC GCCCCTGGTC
CGCCGGAGCA TCGCGATCGC GGGCATCCAG CCGATCGTTC CTGCGTCTGC CTTGCAGGAG
ATCGACCGGG CCTGCCAGTG GCACGCGCTC TATGGCCTGC GTCAGGCGGT CGCGCTGAAG
CGGCTCATCG GGGTTCTGGC GCAGGGTGGA TTCCATCCGA TCGTGCTCAA GGGCCTCGGT
CTGGCGCACC GCGACTATCC CGACCAGGCA TTGCGGCCGC TGCGCGACGT GGACTTGCTG
CTGACGCCGG ACGAAGCACC CGCGGCGCAA GACCTGCTCC TGCGCACCGA AGGATACCGG
CTGGCGCCCT GGGCGGGGAC CTATGGCGTG GAGTACGGGC ACCAGATGCC CGAACTGCAG
GACGTGGAGT TCGAACTTAC CATCGAGGTC CACCACAGGA TCAACGCGCG GGGCTGGGCG
CAGGAGCCCT TGTTGCTCGA GTTGATCCGC GGCGAGGCGA CCGAACTGAC CCTTCTCGGC
GCACAGGTCC GCGTTCCTTC ATCGCGTGCG AACTTTCTCC ACCTGGTCGA ACATGCGACG
CTCCACCATG CTTTCGAGAA TGGGCCGCTG GTGCTGGCCG ATCTGCATTT TCTTGTGCAG
CGCAACGAAC TGGACTGGGG CTGGATCGAG GCAGAGGCGG CGCGGCTGGG CCTCGCCAAT
TCGCTTCGCC TGCTGGCGAC GGTGGCTGCG GAGCTAGGCG CGGGCTGGCC GCCCGCGCAC
CTGGCCAACA AGGAATGCGT GCCGGACCTG CACCTGGCAT CGGCGCATGT CGCGATGCTC
CAGGACAAGG AGGCATCCGA GCGCAACAAG ATGATGCGGC GGCTGGAGGC GGAAACCAGC
GGTGACAGCG GCTGGCGGGC GGCTGTTGCG CGGGCATTCC GGCCCAATCC CCACCAGCTT
GCCGCCTTCG CCGGATCGCG GCACGACGAC TGGCGGCGGT GGCTGGGCTA TCCGGCGTGG
CTGTTCAACC GCGCGCGGCG CTATCTGGTT GCCTCGCGGG ACGAGGTCGT GAGATCCGGA
GCTGAGCGCG AGGCGGAAAT GGTCAACTGG CTCCGTCTCG GCTGA
 
Protein sequence
MPRHLEMTAR NVTEAPHETL LLRICAAPDA DHEMAFRGLD EEGWSDLAAL AEDKRAAPLV 
RRSIAIAGIQ PIVPASALQE IDRACQWHAL YGLRQAVALK RLIGVLAQGG FHPIVLKGLG
LAHRDYPDQA LRPLRDVDLL LTPDEAPAAQ DLLLRTEGYR LAPWAGTYGV EYGHQMPELQ
DVEFELTIEV HHRINARGWA QEPLLLELIR GEATELTLLG AQVRVPSSRA NFLHLVEHAT
LHHAFENGPL VLADLHFLVQ RNELDWGWIE AEAARLGLAN SLRLLATVAA ELGAGWPPAH
LANKECVPDL HLASAHVAML QDKEASERNK MMRRLEAETS GDSGWRAAVA RAFRPNPHQL
AAFAGSRHDD WRRWLGYPAW LFNRARRYLV ASRDEVVRSG AEREAEMVNW LRLG