Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2718 |
Symbol | |
ID | 3918493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2949732 |
End bp | 2951636 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640445496 |
Product | hypothetical protein |
Protein accession | YP_497988 |
Protein GI | 87200731 |
COG category | [S] Function unknown |
COG ID | [COG3567] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01555] phage-related protein, HI1409 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCACCG TGACCAACAT CCTCGACAGC CTCCGCAATG CGCTCACAGG CCAAGGCACG TCGCGCGATC CCCGCACCGC ATCGGCGTAC TGCGCGACCC GCGCCCTCAC CCAGCACGAA ATCCATGCCG CCTACTCCGG CTCTGGCCTG CTCAAGAAGA TCATCCAGAT CCCACCGCTC GACATGGTGC GGGAATGGCG CGACTGGTCC GGCCTCGACG ATGACCAGGC CGCGCTGATC TGGGACGAGG AAAAGCGGCT CGGGCTGCGC GAGAAGGCGA AGCTTGCCGA GACCCTGCGC GGACTTGGCG GCGGCGCGTT CATCCTCGGC CTGCCTGGTC AGCCTGCCAC GCCCGCACCG AAAGCGGTGG CCAAAGGTGG TCTCGCCTAC ATCAACGTCG TCTCGCGCTG GCACCTGACG TTCGATGCGC TGCAGGACGA TGCCCGCCTT CCCGGCTACG GCGAGCCGCA GATGTGGCGC ATGCAGACCG CAACCGGTCA GCAGCTCATT CATCCCTCGC GCGTCGTGAC GTTCCGTGCC GACACCAGCG GTTCCCTGAT TGCGAGCATG GCCAGCCAGG ACGACGCCTA CTGGGGCGAA AGCAGGCTGG AGCAGGTCCT TGAAGCAGTG AAGGACAGCG ACACCGCGCG CGCGTCGTTC GCAGCGCTGC TTCACAAAGC CCGCCTAACC CGCATCGGTA TTCCGAACCT GTCGGACATC GTGTCGACCA GCGACGGAGA GAGCCGCATT GGCGCCCGCC TCGGCATGAT CGCGCTGGCG GAAAGCATGT ACAACGCGGC GGTCTACGAC AGCGGAAATG GCGCCGACGG CCCGGCGGAG AAGATCGACG ACGTTGCCTA CAACTTCGCC GGGGCCAAGG ACGTTCTCAA CGCCTTCGCC GAGTTCGCCG CCGCGATTTC CGACATCCCG GCGACGCGTC TGCTCGGCCG CGCCCCAGAA GGCATGAACT CGTCCGGCGA CAGTCAGCAA AAGGACTGGT CGAAAAAGGT CCGCGCGATG CAGACCCTCG AACTCGGGCC CTGTCTCGAC CGCGTCGACG CTTACCTCGT GCCCTCCGCG CTGGGGCGCG CCGAACCGAA CGCCAGCTAC GCATTTGCCC CGCTCGATGT CGAGACCGAC AAGGAGCGCG CCGATCGCTT CGCCAAGCAG ATGGAAGCGG CCGAGAAGCT GGCCGGCCTC AACGCAATGC CGGAGCAGGC ATTCAACCGC GGCATCCAAT CGCTGATGAT CTCTGAGGGC TACCTCCCCG AACTTGAGGC GGCGCTGTCC GATATCCCCG ACGACGAGCG CTATGGCATC GTTGCCGATC CCTCGCCCGA GGACATGAAT CCCGATGGGA CGAAAGGAGG TGATCCGGCT ATCTCGGCTC CGGGCGGGAC CTCGACCACC CCGCCTGTGG CTGCCAACGA CGCCAGCCCG CGACCGCTGT ATGTCCATCG CAAGTTGCTG AACGGCGGTG AGCTGATCGA CTGGGCGAAA GCACAAGGCT TCGACGTGAC GGTCCCTGCC GACCAGCTCC ATGTCACCGT GCTATATTCT CGTGCCGCTG TCGATCCGAT GGCGATGGGC GAAGGATGGT CGAGCGATCC CGATGGCGGT CTGGTGATCA AGGCTGGCGG TCCGCGCGCG CTCGAACGGT TCGGAGAAGG CGCCGTCGTC CTCCAGTTCG CGTCGTGGTC GCTGCAGTCG CGCCACGACG AGATGGTGCG CGCCGGCGCA AGTCACGACT ATCCGGAATA CCTGCCGCAC GTGACGCTGA CCTATCAGGC ACCGGAAGGC ATCGACCTCG AAGCGATCAA GCCGTTTTCC GGAGAGTTGC GCTTCGGGCC TGAGGTGTTC GAGCCGCTGG ATCTGGACTG GAAGTCGAAG ATTACGGAGG AGTGA
|
Protein sequence | MGTVTNILDS LRNALTGQGT SRDPRTASAY CATRALTQHE IHAAYSGSGL LKKIIQIPPL DMVREWRDWS GLDDDQAALI WDEEKRLGLR EKAKLAETLR GLGGGAFILG LPGQPATPAP KAVAKGGLAY INVVSRWHLT FDALQDDARL PGYGEPQMWR MQTATGQQLI HPSRVVTFRA DTSGSLIASM ASQDDAYWGE SRLEQVLEAV KDSDTARASF AALLHKARLT RIGIPNLSDI VSTSDGESRI GARLGMIALA ESMYNAAVYD SGNGADGPAE KIDDVAYNFA GAKDVLNAFA EFAAAISDIP ATRLLGRAPE GMNSSGDSQQ KDWSKKVRAM QTLELGPCLD RVDAYLVPSA LGRAEPNASY AFAPLDVETD KERADRFAKQ MEAAEKLAGL NAMPEQAFNR GIQSLMISEG YLPELEAALS DIPDDERYGI VADPSPEDMN PDGTKGGDPA ISAPGGTSTT PPVAANDASP RPLYVHRKLL NGGELIDWAK AQGFDVTVPA DQLHVTVLYS RAAVDPMAMG EGWSSDPDGG LVIKAGGPRA LERFGEGAVV LQFASWSLQS RHDEMVRAGA SHDYPEYLPH VTLTYQAPEG IDLEAIKPFS GELRFGPEVF EPLDLDWKSK ITEE
|
| |