Gene Saro_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3641 
Symbol 
ID5077789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp269328 
End bp270560 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content67% 
IMG OID640481364 
Producthypothetical protein 
Protein accessionYP_001166026 
Protein GI146275866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.421002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCTT CCGATACTGC AACAGAGCGC CTGCTTTCCG GCCGCGCCTG GGACGATTTC 
TGCGACACGC TGAAGGTGGC GGGGCGGCAC ATCGAAGCGG TGGACGGACC GCTTTCCGAC
CTCGACCGCG CGGAATGGTA CCGCTTCATG ACCCGGCTCG CGCGGTCGAG CATGGAGCGC
CTGCTGGAGA ACGCCGAGCC GACCCGCCCG CGCCTGCGCG ACATGGTGTG GCGCCAGTCG
ATCAACGTCC AGACGGTCGA CCAGGACCAC CTGATGTGCC AGTTCGACGA AGCGCGCGAC
TATCTGATCA CCGGCACGCG CGGCACCATC CCCTATTTCG TGATGGCGCT GTGCACCTGC
CCCGCCCCTG CGGTACCGGG GGCCGAGGAC TGGGCCGGAC AGGGCGTGGA AGGGCTGGCG
CGGTTCGATC CGTCGAACCT CAAGACCACC GGCTTTCTCC ATTCGGGCCA GATGAAGATC
GAGGCCGACG GCAGCTTCGA GGTCGTGCTT TCGCAGAACG ATCCGGGCGA GGGGCGCAAC
TGGTTGAAGC TGACGCCGGA CACGAACTGC ATCCTGATCC GCCTCGTCTG GTCGGACCGC
CTGCGCGAGA CGGCACCAGC CATGAACATC GCGCGGGCCG ACAAGGCGGA GCCGGAACCG
GTCACCCCGG CCCTGATCGC GGACAACCTG GCGTGGACCG CGCAGGCAGT GCTGGGCTAT
GCCGAACTGG TCCGCAACTG GTGGCAGGGC AGCCAGGGCA ACTTCGCCGC GCGGCTCAAC
CGGCTAGACT ACAGCCGCGC ACAGTACCTT TCCAACGGCG GCGTGCCCGA CCGGCACGTG
GCCTTCGGCG GCTGGGAAAA GGGCAAGGAC GAGGCGCTGG TGATCGAGTT CACCCCGCCC
GAGTGCGAAT ACTGGAACTT CCAGCTCTGC AACGTGTGGC AAGAGAACCT CGACACGTTC
GAGGACGGCA ACGGCTGGAT CAACAACTAC CGCCACGTGG CCGAGCGCGA CGGGCGGGTG
CGGGTGGTGA TTGCGGAATC CGATCCCGGC ATCGGCGGCA ACTGGATCAA CAGCTATGGC
CATGAACGCG GCATCTGGGG TCTGCGGCTG GTCCTGACCG AACGGACCGT GCCGGTGAAC
CTGTGGCGCC TGCCGCTGGC GGCACTGGAA GCGCGCGGGC GGGACGCACT CGATCCGGCG
CAGGCGGTTC TGACCGGGCA GTTCGTGGAC TGA
 
Protein sequence
MSASDTATER LLSGRAWDDF CDTLKVAGRH IEAVDGPLSD LDRAEWYRFM TRLARSSMER 
LLENAEPTRP RLRDMVWRQS INVQTVDQDH LMCQFDEARD YLITGTRGTI PYFVMALCTC
PAPAVPGAED WAGQGVEGLA RFDPSNLKTT GFLHSGQMKI EADGSFEVVL SQNDPGEGRN
WLKLTPDTNC ILIRLVWSDR LRETAPAMNI ARADKAEPEP VTPALIADNL AWTAQAVLGY
AELVRNWWQG SQGNFAARLN RLDYSRAQYL SNGGVPDRHV AFGGWEKGKD EALVIEFTPP
ECEYWNFQLC NVWQENLDTF EDGNGWINNY RHVAERDGRV RVVIAESDPG IGGNWINSYG
HERGIWGLRL VLTERTVPVN LWRLPLAALE ARGRDALDPA QAVLTGQFVD