Gene Saro_0660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0660 
Symbol 
ID3918085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp694266 
End bp697238 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content62% 
IMG OID640443391 
Producthypothetical protein 
Protein accessionYP_495941 
Protein GI87198684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGCG TGAAGGCGGG CACTCTCGAA ATCGAGATCC TGACGAACAT GGCGCGCCTG 
CAGGAGGAAA TGCGCCAGAT CGAAAAGTCG GTCGGCAACA TGTCAACCGG GGTTGCGCGC
TCCACCAAGG CTGCAAACGA CAATATCGGG AGTATTGGCA AGACCAGCGG CCTAGCTAAG
CACCACGTGC AGAATCTGGC CTTTCAGTTC CAGGATCTCG GTATCCAGCT CACCGCCGCT
GCCCAGAGCA GCAAGCCCTT CCAAGGGGCG ATGATGGCGC TATTCCAGCA GGGCACGCAG
ATTCAAGGGG TGATGTCGCA GGCGGGCATA GGCGTTGGGG GCCTGATCGC CCAGATCGGG
CGCTTTGTCC TCGCCGCCGC GCCCGCCATC GCCGCCATCG GTGCAATCAG TGCCGCGGTT
GGTCTCGTCA CCAGCGAGAT CAACGAGAAC AGCAAGGTCA CGGTCACCTG GCAGGACACC
CTGCTGGGTA CTTATGACGC ACTGAAGAAG TATCTCACCG ACCAGCTCAC CGGTGCATTC
AAGGCATTCG GCCTCGAAAC TTCGAACGTC TGGAAGGACG TCGTCAACGC GGCGAAGTGG
GGAATCAACT GGATCATCGG CGCGATGACG GTGGTACCGC GGGTTCTTCG CGACACCTGG
AGCCTCATTC CCGCAGGCGT AGCGGACATC TTCGTGACGG CTGCCAATGG CGCCATCCGG
GCAGTCAACG CGATGGTCGC GAAGACCGTC GAGATCCTCA ATGGCTTCAT CAGCAGCGCC
AATGTCATCC TTGGCAAGGT CGGCCTAGAT CTTCCCAAGT TGGCAGCTCC GCAGATCGGG
GAGCTCCAGA ACAGCTATGC CGGAGCCGGT GCCAAGCTCG GAACCGCGTT CATGGGCGCA
ATCCGTGACA CCGTGACGCG TGACTACCTC GGAGACGCTG CCGCAGCGAT CAGTCCGTTT
GCCCAGGCGC GGGCTGTGGA GCGCATGAAG AAGGACGCGA AGAAGGCGGG CAAGGAGGCA
GGAAAAGCAC TCCGCGATGG CGCCGAAGAC GAGCTGAAGA AGCTCCTCGC ATCTCTGGAG
GGCACCTTCG CGATCTATCG GGACGCGGCC AAGACCACCG GGAAGATGCT GGACAAGCAG
CTCGATCGGG ACTGGCAACG GATGTTCGAC GAGATGGCGG ACAAGCAGCG CAAAGCCTCT
CAGGCGGCAG TTGATGCTGC CGATGCAAAT GCTGCCTGGA ACGACGAATT GCGCCGCACG
ATCGAGCTTC TCGATCAGAT TGGCGGGTTT GCAAGCATTC TGGCGAACAT CGGCGCAGTT
ATCGAGGGCT TCTCCAGCGG CGACTTCTCG GGTGTGCGCG GTCCGCTGGG CGGCCTGCTC
GGCCTGGTTG GCAGTACCGA TGAAGGCAAG CAGGCGATCA AGGACCTGGG TGTTGTGTTC
CGCGATGCGC TCGACGGCGT GTTCGGCGGA GAGGGCAGCT TCACGAAGGC ACTGAAGGCG
GCGAGCGTGG GTGTAGCTGC CGGGCAGATG GTGTTCGGTT CCAAGAACAG CGGCATCGGT
TCAGCAGTCG GCGGGGTGCT GGGAGAAGTT GCCGGCAAGG CTATCGGCAA ATCTGTTGGT
GGGCTGCTCG GCAAAGTTGG TGGTCCGCTT GGGTCCATTG TCGGTGGCAT TCTCGGCGGC
GCGCTTGGCA GCCTGTTCAA GACGGTAAAG ACGGGCTTTG CCGTCGTTTC CAACACGGGC
GTGACCTCAG GAGGCAGCAG CTCCGAACTT GCGTCGTCCT CGAAATCGAG CGGCACGGGA
ATCCAGGCCG CAATTCAGAA CATTGCCGAT CAGCTGGGCG GGTCGGTGGG GAACTACTCG
GTCTCGATCG GCAAGCGGTC TTCGGGCTGG ATCTCGGTCT CGGCGTCGGG CTCGTCGCAG
GTTGCCGACA AGAACTGGAA GAAGCGCAAC GTCGGCGGTG ACTTGATCTA TGATGGCAAG
GACGAGGCCG AGGCCCTTCG CGTCGCCCTG CTCAATGCCA TTCGCGACGG AGCCATTCAG
GGCGTGCGCG CGGGCACTGC CGCGCTCCTC AAGAAGGATG GCGACATCGA GACCCAGTTG
AACAAGGCCC TGAAGTTCGA GGGCGTGTTC ACCAGCCTCA AGCAGATGAC GGACCCGGTC
GGGTATGCGC TGCAGCAGCT GACCAAGGAG TTCGAAGGCC TCAAGAAGAT CTTCGACGAG
GCAGGGGCGA CAGCGGCAGA GTACGCGGAC CTCGAGAAGC TCCTCAACCT CCAGAAGCAG
GAAGCCATCG ACAAGGCGGC GGCCGAGGTG CGTCAAAAGG CACTCGAGGC CGTCAACGAT
CCGCTGAAGC TGCAGATCCG CATATTGGAG CTGCTCGGCA AGGGAGAGGA TGCCGTGGCT
GCCGCGCGCA TCATGGAACT CGCCAGCCTC AAGTCGTCTC TTCAGCCCCT TCAAGCGATG
GTCTACCAGC TGGAAGACGC GAAGGCGGTA ATCGATCAGT TCGGGCCACT GGCCGACGAT
TTGCGCGCCT ATCGCAAGGA GCTGCTGGGC GGCAGCACAA CCATGGGGCT TGCTTATCTG
GCGGGCCAGT TCCGCAGCAC TGCGGCCTCG GCCGCCGGCG GCGATGCCAC CGCCCTTGGA
CAGTTGCGCA GCGTTTCTGG TCAATACCTC GATGCAGCCA AGGAGAACGC GGCCAGCGCG
CTCGACTACC AGCGGGCTGT TTCCGAGGTT CTTGCGGCTG TGGACCGCGG GCTCTTCGCT
GCCGACGCCC AGGTCGATTA CGCGCAAGCC CAGATCGACG CGATCGAGAA CACCGCGAAC
ATCATGCAGT CGATGAAGAC CGAGCTGGTG ACGCTGCAGA AGCAGGTCGC TGACACCAGC
GCAACCACGC TCAAGCTCTG GCAGCGGTTC GAGATCAATG GCTTGCCGGT GAACAACGAT
GCCGGGCCGG TCCAGGTGGA GGTAGTCTCG TGA
 
Protein sequence
MAGVKAGTLE IEILTNMARL QEEMRQIEKS VGNMSTGVAR STKAANDNIG SIGKTSGLAK 
HHVQNLAFQF QDLGIQLTAA AQSSKPFQGA MMALFQQGTQ IQGVMSQAGI GVGGLIAQIG
RFVLAAAPAI AAIGAISAAV GLVTSEINEN SKVTVTWQDT LLGTYDALKK YLTDQLTGAF
KAFGLETSNV WKDVVNAAKW GINWIIGAMT VVPRVLRDTW SLIPAGVADI FVTAANGAIR
AVNAMVAKTV EILNGFISSA NVILGKVGLD LPKLAAPQIG ELQNSYAGAG AKLGTAFMGA
IRDTVTRDYL GDAAAAISPF AQARAVERMK KDAKKAGKEA GKALRDGAED ELKKLLASLE
GTFAIYRDAA KTTGKMLDKQ LDRDWQRMFD EMADKQRKAS QAAVDAADAN AAWNDELRRT
IELLDQIGGF ASILANIGAV IEGFSSGDFS GVRGPLGGLL GLVGSTDEGK QAIKDLGVVF
RDALDGVFGG EGSFTKALKA ASVGVAAGQM VFGSKNSGIG SAVGGVLGEV AGKAIGKSVG
GLLGKVGGPL GSIVGGILGG ALGSLFKTVK TGFAVVSNTG VTSGGSSSEL ASSSKSSGTG
IQAAIQNIAD QLGGSVGNYS VSIGKRSSGW ISVSASGSSQ VADKNWKKRN VGGDLIYDGK
DEAEALRVAL LNAIRDGAIQ GVRAGTAALL KKDGDIETQL NKALKFEGVF TSLKQMTDPV
GYALQQLTKE FEGLKKIFDE AGATAAEYAD LEKLLNLQKQ EAIDKAAAEV RQKALEAVND
PLKLQIRILE LLGKGEDAVA AARIMELASL KSSLQPLQAM VYQLEDAKAV IDQFGPLADD
LRAYRKELLG GSTTMGLAYL AGQFRSTAAS AAGGDATALG QLRSVSGQYL DAAKENAASA
LDYQRAVSEV LAAVDRGLFA ADAQVDYAQA QIDAIENTAN IMQSMKTELV TLQKQVADTS
ATTLKLWQRF EINGLPVNND AGPVQVEVVS