Gene Saro_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3103 
Symbol 
ID3918145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3320807 
End bp3321937 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID640445887 
Producthypothetical protein 
Protein accessionYP_498372 
Protein GI87201115 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.73421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCTTA CAGCGCGCAT CCACAAGGCC GTTTCCGAAA TTCCGGCAGA GGACTGGGAC 
CGCCTTGCCG GACCGGGCAA TCCCTTCGTT TCGCACACTT TTCTGGCGTT GCTGGAAGAG
TCGGGCTCGG TTGGCGGCCG CTCCGGATGG TCGCCCCTGC CGATCGTGAT CGACGACGGG
AATGGGCGAC CGGCGGCGGC CTTGCCTGCC TATCTGAAAA GCCACAGCCA GGGCGAATAC
GTGTTCGACC ATTCGTGGGC GGACGCCTGG CAGAGGGCGG GCGGCAGCTA TTACCCCAAG
CTCCAGATCT GCGCGCCGTT CACCCCGGCC ACGGGGCCGC GCCTGCTGCT TGGCGACCGT
CCCGACCTTG CCGGCCCGCT GCTGCGCGCC GCGGAGCAGT TGTGCGAGGG CAACGAGCTG
TCCTCGGCCC ACGCGACGTT CGTCGAACCG GCGCAGTTGC CGATGTTCGA GGCCGCCGGC
TGGTTGCCGA GAAGCGACAT CCAGTTCCAC TGGGAGAATC GCGGCTATGC CAGCTTTGCC
GATTTTCTCG GCGCGTTGTC TTCGGAGAAG CGCAAGAACC TGCGCAAGGA ACGTGCCCGC
GCCCAGGACG GGGTGGAAAT CCGCCAGCTT ACCGGCGCGG ACATTCGCCC CGAGCATTGG
GATGCCTTCT GGCTGTTCTA TCAGGACACC GGCGCACGCA AGTGGGGACG CCCGTACCTG
ACGCGCCGCG CGTTCGACCT GATTGGCGAG CGGATGGCGG ACAAGGTCCT GCTGGTGCTG
GCGTTTCTCG ATGGCGAGCC GGTGGCGGGG GCGCTCAACT TCATCGGCGC GCAGGCGCTT
TACGGGCGAT ACTGGGGCGC GCTGGTCGAG AAGCCCTTCC TGCATTTCGA GCTTTGCTAT
TACCAGGCCA TCGACGCCGC GATCCGGCTT GGGCTGGATC GGGTGGAGGC GGGTGCGCAA
GGCGGCCACA AGCTGGCGCG GGGCTATGAG CCGGTCAGGA CGTGGTCCGC GCACTTCATC
GCGGACCCCG GATTCCGCCG GGCGGTATCT GATTTTCTGG AACGGGAGCG TGCCGGCATC
GCGCAGGACC AGATGCACCT GGGCGAGCGG ACTCCGTTCC GGAAGGGATA A
 
Protein sequence
MTLTARIHKA VSEIPAEDWD RLAGPGNPFV SHTFLALLEE SGSVGGRSGW SPLPIVIDDG 
NGRPAAALPA YLKSHSQGEY VFDHSWADAW QRAGGSYYPK LQICAPFTPA TGPRLLLGDR
PDLAGPLLRA AEQLCEGNEL SSAHATFVEP AQLPMFEAAG WLPRSDIQFH WENRGYASFA
DFLGALSSEK RKNLRKERAR AQDGVEIRQL TGADIRPEHW DAFWLFYQDT GARKWGRPYL
TRRAFDLIGE RMADKVLLVL AFLDGEPVAG ALNFIGAQAL YGRYWGALVE KPFLHFELCY
YQAIDAAIRL GLDRVEAGAQ GGHKLARGYE PVRTWSAHFI ADPGFRRAVS DFLERERAGI
AQDQMHLGER TPFRKG