Gene Saro_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1822 
Symbol 
ID3918381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1922605 
End bp1923888 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content69% 
IMG OID640444563 
Producthypothetical protein 
Protein accessionYP_497096 
Protein GI87199839 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.302713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGA GCGCGCCCCG CTCGTTCTTT CTGCCCCGGC CGCTCGCGCT GCCGGGGCGG 
GGTTTCTGGG TGATCGCCCA CCGCTGGGCG GGCCTTACCC TTGCGCTGTT CCTTGGCGTG
GCCGGGCTCA CCGGCTCGCT CCTGCCGTGG ATCGAGGAGC TGGAAGCGGC CACCGCGCCG
CAGCTCCACA ATTCGGTCTG GACCGGCACG CCCGATCCCC TGCGCGTGCG CGAGGAAGTG
CTGGCCCGCC ACCCCGGCGC GGCCGTCGAT TTCCTCCCCC TCACCGTGGA GCCAGGCAAG
TCCCTGCGCC TCCACCTCCA CTGGCTCGAC CCGAAAACCG GGCTGGAGCG CGAACGCGGC
CCCGGGGTGC CCGACTGGAA CGACCTGTTC CTGAATCCCG TCTCAGGTGA AGAGCAGGGC
CGCCGCGAAT GGGGCAATAT CGGGCAAGGC CTCAAGAACC TCATGCCCTT CCTCTACCGC
CTGCACTATA GCCTCGCGCT TGGCGCGATC GGCACGCTGG TCTTCGGGGT GGCGGCGCTG
ATCTGGACCG TGGACTGCTT CGTCGGATTC TACCTGACCC TGCCCCCGCG CGCGCCAAGG
TCCGCCCGAG CCCCTTTCCT CGAACGCTGG CGCCCCAGCT GGCGCGTGCG GTGGAAGTCC
ACGCCCTACA AGCTCAACTT CGACCTCCAC CGCGCCGGGG GCCTGTGGCT CTGGCCGCTG
CTGCTGGTCT TTGCGTGGTC GAGCGTCTCG TTCAACCTGC CCCAGGTCCA CGTACCGATA
ATGCAGGCGG TGGGCGCGCA GGACGCGCGT CTCGTGCTGC TGGAAAGCAC GCTACCCGCC
CCCCGCAACG CCCCGCGGCT GGGCTTCCGG GAAGCCGTTG AACGGGGGCA GGAGCTTGCC
GAACAGGAGG CGACGAAGCA GGGTCTTGCC GTCCTCGATG AAGGCGAGAG CTGGATCTGG
CACGTGCCCA CCAGCGGCCT CTACGCCTAC GGCTTCACCA CCGGGGCCGA CATCAGCCAC
CACGGCGGCG GCACCCGCGT CGCCTTCGAC AGCAACACCG GCGTACTGAA GTCAGTGGAC
TGGCCGAGCG GCGTCAACGG CGCCAACACC TTCACCAACT GGCTGACTGC GCTGCACACC
GCCCATGTCT TCGGCCTGCC CTACCGCCTG TTCGTCAGCG CGCTCGGCCT GATGGTCACC
ATGCTTTCGA TCACCGGCGT GGTGATCTGG CTGAAGAAGC GCTCCGCCCG CGCCGGCCGC
GCAATCCGCC AGCCCAAAAC ATGA
 
Protein sequence
MATSAPRSFF LPRPLALPGR GFWVIAHRWA GLTLALFLGV AGLTGSLLPW IEELEAATAP 
QLHNSVWTGT PDPLRVREEV LARHPGAAVD FLPLTVEPGK SLRLHLHWLD PKTGLERERG
PGVPDWNDLF LNPVSGEEQG RREWGNIGQG LKNLMPFLYR LHYSLALGAI GTLVFGVAAL
IWTVDCFVGF YLTLPPRAPR SARAPFLERW RPSWRVRWKS TPYKLNFDLH RAGGLWLWPL
LLVFAWSSVS FNLPQVHVPI MQAVGAQDAR LVLLESTLPA PRNAPRLGFR EAVERGQELA
EQEATKQGLA VLDEGESWIW HVPTSGLYAY GFTTGADISH HGGGTRVAFD SNTGVLKSVD
WPSGVNGANT FTNWLTALHT AHVFGLPYRL FVSALGLMVT MLSITGVVIW LKKRSARAGR
AIRQPKT