Gene Saro_3589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3589 
Symbol 
ID5077738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp209439 
End bp210644 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID640481313 
Productplasmid replication initiator protein-like protein 
Protein accessionYP_001165975 
Protein GI146275815 
COG category[L] Replication, recombination and repair 
COG ID[COG5534] Plasmid replication initiator protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCA ACCGCCCTGC CCCCTCCGGC GACCAGTTCG ACCTGTTCCT GCCATACGTG 
GCGGACATGC CGATGCGCGA CCAGCGCGAG ATGATGGAAC GTCCGTTCTT CAGCCTCGCC
AAGTCGAAGC GCGTGAAGCC GATCGACTAC ACCTCCCCTG ACGGCAAGCT GTGGGTGCAC
GTATCGGGTA ACCCCGACTA TGGGATGGCG ACGATCTGGG ACGCCGACAT CCTGATCTAT
TGCGCGAGCG TGCTGGCCGA CATGGCCCGG CGCGGGGTAA ACGACGTGCC GCGCAAGCTG
CACCTCATGC CCTACGACCT GCTGCGCGCA ATCGGCCGGC CGACGACGGG GCGCGCCTAC
GAATTGCTCG GCCAGGCGCT CGACCGCCTT GTCGCCACCA CGATCAAGAC CAACATCCGC
GCAGAGAACC GGCGCGAGGC CACATTCTCG TGGCTCGATG GCTGGACCCA GCTTGTCGAT
GAAAAGACCG AGCGTTCGCG CGGGATGACG ATCGAGCTGT CCAACTGGTT CTGGGAAGGC
GTGATGATGA AGGGCGGGGT GCTCTCCATC GACCGCGCCT ACTTCGACAT TACCGGCGGC
CGCGAACGCT GGCTCTACAA GGTCGCGCGC AAGCACGCCG GCGGGGCAGG GGAGGAGGGC
TTCGCGATCT CGATGCCGGT GCTCTTCGAG AAATCGGGCG CGGAAGGCGA GTACCGCCGC
TTCAAGTTCG AGATCCTGAA GCTGGCCGAA AAGAACGCGC TGCCGGGCTA TGGCCTGTCG
GTCGAAACCG CCAGGGGAGG CGAACCCATG CTGCGCATGC GGCGGGTCGA CGGCAAGGAC
GGCGCGGACC GGGCATTGCC CGAAGCGGGA CGACAAGGAG CCGAAGCCCG TACCGTGGCG
AGCACGGCAC CCGATGTTTC CCCGGGGAAA CATTCTTCCG GAGCTAGCGA AACGGTCGAC
GTGCGCGCGC TGATCCGCAA GACCGTGGCT GGCGTCAGCG ACGCCGCGAC GCGGGGCTTC
ATGACCGACG AGACGATCCG GCACTTGCGC GAAACCTGCC CGGGCTGGGA TCTCCATGCG
CTGCACGCCG AGTTCGAAAG CTGGGTGAAC GGCGACTCTG CACGGCTTCC GGCTAACTGG
CAGAAGGCCT TCATCGGCTG GGTGAAGCGC CACCACGAAA AGAACGGCCA CGCGCTGCGG
CGCTGA
 
Protein sequence
MSRNRPAPSG DQFDLFLPYV ADMPMRDQRE MMERPFFSLA KSKRVKPIDY TSPDGKLWVH 
VSGNPDYGMA TIWDADILIY CASVLADMAR RGVNDVPRKL HLMPYDLLRA IGRPTTGRAY
ELLGQALDRL VATTIKTNIR AENRREATFS WLDGWTQLVD EKTERSRGMT IELSNWFWEG
VMMKGGVLSI DRAYFDITGG RERWLYKVAR KHAGGAGEEG FAISMPVLFE KSGAEGEYRR
FKFEILKLAE KNALPGYGLS VETARGGEPM LRMRRVDGKD GADRALPEAG RQGAEARTVA
STAPDVSPGK HSSGASETVD VRALIRKTVA GVSDAATRGF MTDETIRHLR ETCPGWDLHA
LHAEFESWVN GDSARLPANW QKAFIGWVKR HHEKNGHALR R