Gene Saro_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3026 
Symbol 
ID3916637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3235845 
End bp3237056 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID640445805 
Producthypothetical protein 
Protein accessionYP_498295 
Protein GI87201038 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.85184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGAA CGAAGCGCTC GAAGCCGCTC TATCAGCGCG GCCCCTTCGC CCTCTATCGC 
CGCCCCGATC GCACGAACCT CGAAATCGTC TGGTACGATG CCGAGCGCAA GCGTGAACGA
TGCATTAGCG CGGGCACAAG CGATGTTGGG GAAGGCAGCA AGGCCCTAGA TCGCTGCTAC
CTCAAGGAGC AGGGAAACCA CGTCTGCCCC AACTGCGGCA GAGCGATGGA AGGCGAAGCT
GCGCCCTTAC TGGCAAGTGC CATCACTGAC TATCTGACCA TTTCCGAAGG AAAGGCTGGA
TATAAGGCAT CTAAGACCCG GCTTTCACAT GTCGTTGGAT ACCTTGCAGC AACCAATCCC
GCCGTCACCC TACCCATGAT TTCGCGGGAT TGGGTAGAAG GTTTCCGGAA GTGGATGCGC
GGTAAAGATT ACGCCCCCGG CCATATCGAG GGATGCGTTC TGCAACTCGC CGCAGCGATC
AATTCGGTCC ATGGCCATCA GGCGCAATTC AAAGCCCGCT CCGTAAAGGA CGCGGCCCGC
TCTCCAGTTT ATCGGGCAAG CGTTGAGGAG CTTGCGGCCA TGTTCCGCTT CTGTATCGAT
CCGCCCGCGC CCAAAGGTAG GCAGTGGAGC GACAAGGAGC GCGCCATGGT CATCGCCACC
CGCGAGAACC TTTTGCGGTA TCTGCGAGCT GCTGTCGCAA CATGGGCGCG GCCAGACGCG
ATCTTTGATC TCAAGGCCAA GGGGCAATGG CATAGCGCGG CAGGGGTTCT TGATCTGAAC
CAGCCAGGCC GACCGCAAAC CAAAAAGTAC CGCCCCATCA TACCAGTCGC GCGGCAGTTT
CGGCCTTGGC TTGATGAAGC GCTTGCCCGC GAAAGCTACA TCCCCGTCAG CACTGTGCGT
CATGGATGGG CGTCAATGCG GATGCACCTT AAGTTACCGA CAGGGCGCGA GGCAGGCGAA
AAGCTTATCC GCAGGAGCAT GGCAACCATA TGTCGCAAGC TCATCGGAGA GGCGAATTGG
GCGCAGGGCG AAATGATGCT CGGGCACCGG AAATCGAGCA TTTCCGACAT TTACGCCATT
GTTGATCCCG CGAATCTCGG CCTCGCTCTG GAGGCCACAG AAACGGTTAT TGACCGTATC
GAAGCCTTGA CGCCGGGTGC GTTTTGCCGC ACTCTTACCG CAGAAGCCTC CCCGCTTCGA
GTTGTGAAAT GA
 
Protein sequence
MPRTKRSKPL YQRGPFALYR RPDRTNLEIV WYDAERKRER CISAGTSDVG EGSKALDRCY 
LKEQGNHVCP NCGRAMEGEA APLLASAITD YLTISEGKAG YKASKTRLSH VVGYLAATNP
AVTLPMISRD WVEGFRKWMR GKDYAPGHIE GCVLQLAAAI NSVHGHQAQF KARSVKDAAR
SPVYRASVEE LAAMFRFCID PPAPKGRQWS DKERAMVIAT RENLLRYLRA AVATWARPDA
IFDLKAKGQW HSAAGVLDLN QPGRPQTKKY RPIIPVARQF RPWLDEALAR ESYIPVSTVR
HGWASMRMHL KLPTGREAGE KLIRRSMATI CRKLIGEANW AQGEMMLGHR KSSISDIYAI
VDPANLGLAL EATETVIDRI EALTPGAFCR TLTAEASPLR VVK