Gene Saro_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0038 
Symbol 
ID3916041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp39737 
End bp40942 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content68% 
IMG OID640442763 
Producthypothetical protein 
Protein accessionYP_495321 
Protein GI87198064 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATAG TCGCGACCTT CCTGCTCGTG CTGATGGCCG TGCTCTATGC GTTCAGCCGG 
CGGTACGAAG GGCTGCACCC GGCGGTCGGT TTCCTGCGCG CCTTTGCCGA GGCGGCGATG
GTCGGCGGGC TGGCGGACTG GTTTGCGGTG ACTGCCCTGT TCCGCCATCC GCTCGGCCTG
CCCATCCCGC ACACCGCGAT CATCCCGGAG AACAAGGATC GCATTGCCGA TACGATGGCG
GCGTTCCTGC AGACCAACTT CCTGACCCCG CAGGTGGTGG CGCGGCGCAT GGGCGCGGTG
AACTCGGCTG CCGCGATGGG CGCATTTCTG GCCGACCCGC GCGCCGGCGA AAGCCGCCTG
CGCGACGGCG CCGCCGGACT GGTGGCGGAC GTGCTGGAAT CGCTCGATCC CGAGGAGCTT
GGCGGGCTGG CCAAGGGCGC GCTGAAGGCG CAGCTCGAAA GGCTGGAGCT TTCGCCGCTG
CTGGGCCAGT TGCTGGGCGC GGCGATTGCC GACGGGCGGC ACATGCCCGT GATCGAGAGC
CTGATCCGCA AGGCCGCGGA GACGATCGAG GCCAACGAGC CGCTGATACA GGCGACGATC
CACGAGCGTG CCAACACGAT CCTGCGCTGG ACCGGCCTCG ACGAGAAGCT CGCCAACGCA
ATCCTCGACG GCCTCTACAA GCTTCTGGCC GAGACACTGG TGGTGCCCGA CCATCCCGTG
CGTCGGAAGA TCGAGGACGG CCTTGCCGCA TTGGCGCACG ATCTGGTCCA CGATGCCGAG
ATGCGCGCGC GGGTCGAACG GATGAAGACC GAAGTCCTCG CCAATCCTGC CTTTGCCCGT
TGGCTCGACG CGCTGTGGGA GCGCGGTCGA ACCCGGCTCC TGCAGATCGT CCGCAATCCC
GAGGGCGCGC TTGGCGGACA GCTCGGGGCC AGCCTTGCCG AGCTGGGCCT TGCCCTTCAG
CGTGACGAAC GGCTGCAGCG GGTGGTCAAC CGCTTTGCCC GCAGGACGCT GGTCGGCGTC
TCGACCCGCT ATGGCGCGCA GATCGTGCGG CTGGTGTCGG AAACGGTGAA GCGCTGGGAT
GCGCGGACCG TGACCGACCG CATCGAAGGC GCGGTGGGCC GCGACCTACA GTTCATCCGC
ATCAACGGCA CGTTGGTCGG CGGGCTGGTC GGACTGCTGC TCCATGCCGT GGACCTTGCC
CTGTGA
 
Protein sequence
MRIVATFLLV LMAVLYAFSR RYEGLHPAVG FLRAFAEAAM VGGLADWFAV TALFRHPLGL 
PIPHTAIIPE NKDRIADTMA AFLQTNFLTP QVVARRMGAV NSAAAMGAFL ADPRAGESRL
RDGAAGLVAD VLESLDPEEL GGLAKGALKA QLERLELSPL LGQLLGAAIA DGRHMPVIES
LIRKAAETIE ANEPLIQATI HERANTILRW TGLDEKLANA ILDGLYKLLA ETLVVPDHPV
RRKIEDGLAA LAHDLVHDAE MRARVERMKT EVLANPAFAR WLDALWERGR TRLLQIVRNP
EGALGGQLGA SLAELGLALQ RDERLQRVVN RFARRTLVGV STRYGAQIVR LVSETVKRWD
ARTVTDRIEG AVGRDLQFIR INGTLVGGLV GLLLHAVDLA L