Gene Saro_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2988 
Symbol 
ID3917424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3204101 
End bp3206242 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content62% 
IMG OID640445767 
Producthypothetical protein 
Protein accessionYP_498257 
Protein GI87201000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGTG CGATCTATAA CAACGCAGGA CCAAACGCGC CTTCGGTGGA CCTTATCGCA 
GGCCCCGAGT TCCGCGACGA CTTCAGTACG GATGGCAACC TCAACGGGCG CACCGGATGG
ACCGTGGAAA CGCGCCCGAC CTTCTCCGGA CAGGTCAATG CCATCTCGGC CAGTGGCGGC
AAGGCTGGCG GCACGACCGG CAGCGCGGCC TTTGCCCTGC ACACCGCCAG CGTTGAAACG
GCGCGGGTGC AGTTCAAGAT TGGCGCATAC GGTGCGGCCC TGCATCACCA TGTCAATATC
GCGGCGGTCG ATGCAGAACG AGACTGGCTG AATATCAACG TTGCCACTCA AGTTTCGAGC
GGACTCCAGA CTGGAACGAT CACCTTTGCC AAGGTGGTCA ACGGGGCCTC GGCTTCTAAC
ATCGCGATCC TTCAGCAGTG GCGCACCGAA CTTGGCGACA CGCTCGAAAG CGATTTTACC
GAAGTTGCTG GGGTGAAATA CTGGACCTGC TACCTCAATG GCCAGCAGGC GGGCGCGCCG
GTCGATATCA CCTCGTGGGG CGGTGCGTTC ACAAAGAAGC ACGGCATCAT CGGTAACATC
GCAAGTTCTA CCGACAGCTT CCAGATCGTG GACCCGGCAA CGCAAGTCGC CTTGCGCCTC
TACATGCCGA ACCGCACGAT CTACCGCAAC GCGAACGGCT CGGTTACGTG GTTCGTAAAG
GCGTACTACA CCGGTCCTGA TCCTTCGGGC CTCTATGCCA CGGTCTATGA CCTTTCGTCT
GGTTCGGAAG TCGCGGTATC CGGCCTGACG GACGTTGCCT TGTCGAGCTT CGCGGCGGCA
GGCGGCACGG CAACGGGGAC GCTTTCCGGC ACCTCGTCGC AGGTTTCCAG CGGCGGGCCG
TTCTTCGTTC GTGTCACCCG CAAGCGGCTC TCTGCCCTGA TTGGCAACGC AACCTGTGTT
GTGGACGGGC CCTATCAAAC TGTTGGCGAA ACCGATCTTG CGAATGGCCA GTCGCTCGCA
GTGGCCGCGT TCTCGCAGAC CGCCACCACC GCGACCTATG CGCAGCCGTC GAATTGCTGG
CACCTCGAAG GCGTCCCGAC CAACGGCACG ACCTTGGGGG ATGCATGGAA CCGCCGCCTT
CGCCCGATCA GCGGCAACAC CACAGCCGCC GCGCTGGCCA ACGTGGTCAA GACCGTCTCC
AGTGTAAACT TGACCGTTGC CAGTGGCGGC GTTTCGGGCA CGGCGATTGC CGTTCGCGGG
GCCGGTTCGG TCTGCCAACA GGCGCTCAAG CAGTCCATCG ACCGCGCTGG TGGCAGGGTT
CACTTTGTCC ATCACGTTGA CGGCCAATCG GACGTAACGA CTGCGCAGGC CAGTTACATC
GCCGCAATTC AGGCGATCTA TGACGACCTC GACGCCTACA ACGGCGCAGC GATCAAGGTG
CTGATGCATC CGCTTGCCTC GTGCTGGAAG AACAGCACCG GCAACGACGA GCAGTGGCAG
GCCGTTCGCC GGATGCAATG GCAGCTGACG CAGGATTACC CCTCGCGCTA CTTCTTGGGG
GCCTATACCC TCGACTGCCA GCACTTCGAC AGCCTGCACT TTAACGACGC TGGTTATGCG
ACAATGCTGA CTCGCGCAGG GTGGGCACGG GCCAAGGCTC GCGGTGACGT GGCCAACGAC
CGCAACGGGC CTTCGCTCTA CAGCGTAACC CGCGTCGATG CGCAGACGGT CTATTGCGTT
TACGACCTCA ACGGCGCGGA TAGCCTTGAG CTGGCGAACA CGGCCTACGC CAGCGAATAT
CACGGCGGCA TGTCGTTCTC GACCGCTGCC ACCAAGAGCG GCGGCACCAT CGCGACGAAG
CTCTATCCGA CCGGCGCTAC CGTGGATGCA TCACCGGCAG GCGGGCGGCA GGGCATCACC
TTCACCTTCG CCGCGAACAC GTTCCCCGGC ACGGTCTACG CGTGGGCGGC ATACGGCAAG
AACCCGTTCA ACCCGAACGA CAACAACACC AACACCGACC CGATCAATCT CGACATGGCC
AACAAGGCAT CGATGATCCG GGGTGTCTAT TCGGACGGCA TGAAGGTGGC GTTGCGGCCC
CGGTTCACGA CCGACAGCCT CGATTACCTG AGTGCATCGT GA
 
Protein sequence
MSRAIYNNAG PNAPSVDLIA GPEFRDDFST DGNLNGRTGW TVETRPTFSG QVNAISASGG 
KAGGTTGSAA FALHTASVET ARVQFKIGAY GAALHHHVNI AAVDAERDWL NINVATQVSS
GLQTGTITFA KVVNGASASN IAILQQWRTE LGDTLESDFT EVAGVKYWTC YLNGQQAGAP
VDITSWGGAF TKKHGIIGNI ASSTDSFQIV DPATQVALRL YMPNRTIYRN ANGSVTWFVK
AYYTGPDPSG LYATVYDLSS GSEVAVSGLT DVALSSFAAA GGTATGTLSG TSSQVSSGGP
FFVRVTRKRL SALIGNATCV VDGPYQTVGE TDLANGQSLA VAAFSQTATT ATYAQPSNCW
HLEGVPTNGT TLGDAWNRRL RPISGNTTAA ALANVVKTVS SVNLTVASGG VSGTAIAVRG
AGSVCQQALK QSIDRAGGRV HFVHHVDGQS DVTTAQASYI AAIQAIYDDL DAYNGAAIKV
LMHPLASCWK NSTGNDEQWQ AVRRMQWQLT QDYPSRYFLG AYTLDCQHFD SLHFNDAGYA
TMLTRAGWAR AKARGDVAND RNGPSLYSVT RVDAQTVYCV YDLNGADSLE LANTAYASEY
HGGMSFSTAA TKSGGTIATK LYPTGATVDA SPAGGRQGIT FTFAANTFPG TVYAWAAYGK
NPFNPNDNNT NTDPINLDMA NKASMIRGVY SDGMKVALRP RFTTDSLDYL SAS