Gene Saro_3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3919 
Symbol 
ID5077403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp86362 
End bp87666 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content58% 
IMG OID640481026 
Productplasmid encoded RepA protein 
Protein accessionYP_001165688 
Protein GI146275527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGAG ACACACTGCG TCCAATCGGA CATCAATACG CTCTAGCAAT GCTAGGCGGT 
GGTGAAGAGC GAGTCAGGTC ACTAGCGCAG TCTGCCGGCA CTCAGCTGAC CATGGATGCG
TTTCTCAGGG TTCAGGATGA AGAGCCCGTA CCGGCATTTC TGCATTCAGC GCTTTGCGCG
ATGTCCCTGC CCACGAAGCG GCCGAAGGAT GATACGCAAC CCATCCTTCG CGAAGACGGA
AAGTATGCGC TGGCGATCAA CCCCAGGCCC ATATTGCAAA CCGTCGATGG CAAGCCGACA
CTTCGAAGCC TCGGAGTACC GTATGGGGCC TATCCGCGCG TCGCGCTGAT CTATCTGCTG
TCGCAAGCAG TCACGAAGCG TTCGCGCGAC GTCTACTTGG GTCGCAATTT CACTGAGTGG
ATGCGCCGTC TTGGCTATCA GACAGTTTCC TATGGACCTC GCGGTACCGC CAATTTGATG
AGGGAGCAGG TGGACCGGCT GCTTGCCTGC GAATGGCAAA TCCGCTGGGA GGGTAACGAG
GGTGGGGACA ACGCATTCGC TGTTCGGGAT GTGAAGATTT CCAACGAGTA CGCCGGATCG
CTTGAGAAAA ACGGCGCATT TGCGCGTGAA ATTCGGATGT CAGAGGCATT CTACAGCCAC
CTGCTTGATC ATGCCGTACC GCTTAACGAG GTCGCTATTC GAGAGCTCAA GGGCACCCCA
ACTGCGCTCG ACCTCTATAC CTACCTTGCG TATCGACTGC CACGGATCGG CAGTGACCGG
GGGCAAGTAA TCTCCTGGGA TCAACTGGCC AAGCACTTGG GCAATGACGC CGACAGCAAG
CGTTTCCGGC AAACCGTGCG AGAGACCATG CAGTTGGTTT CGGCGGTGTA TCCCAACGCA
GATGTCGATT TCAGCGGTCG CAAGGTGGTG TTGCGACCTT CGCCAGCCCC ATTGGAGCGA
AAGCTCGTCG GTCCGCACCT GCGTGTCATT GGTGCACCAG CGTTGGAAAC CGCACCGAGA
TCATCGGTCC CGAAGATGGC TCGCACGCCC CTTCGCGAAA CGAAGACCAC CGAACCTCTG
CAGCATTTTC CGGGCGGTAG CCTGACATAC GGCGACCGAG AGACGAAGTT TCGGGCGATC
GGTCTCGATA AAGGTAAGCC GTGGTGTGTT GATACCATGG CAAACGCTTT TCGTGCGGGC
TTCCCTGGCA TCAAGCAAGC GCGCACTGAT GCCGAGTGGC TCAGGGTCTG GGAGGCCTTC
GTTATCAAAT ATGCTGACCG GCGCGCTCAG GCAGGCGCAA ACTGA
 
Protein sequence
MSGDTLRPIG HQYALAMLGG GEERVRSLAQ SAGTQLTMDA FLRVQDEEPV PAFLHSALCA 
MSLPTKRPKD DTQPILREDG KYALAINPRP ILQTVDGKPT LRSLGVPYGA YPRVALIYLL
SQAVTKRSRD VYLGRNFTEW MRRLGYQTVS YGPRGTANLM REQVDRLLAC EWQIRWEGNE
GGDNAFAVRD VKISNEYAGS LEKNGAFARE IRMSEAFYSH LLDHAVPLNE VAIRELKGTP
TALDLYTYLA YRLPRIGSDR GQVISWDQLA KHLGNDADSK RFRQTVRETM QLVSAVYPNA
DVDFSGRKVV LRPSPAPLER KLVGPHLRVI GAPALETAPR SSVPKMARTP LRETKTTEPL
QHFPGGSLTY GDRETKFRAI GLDKGKPWCV DTMANAFRAG FPGIKQARTD AEWLRVWEAF
VIKYADRRAQ AGAN