Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2328 |
Symbol | |
ID | 3915673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2473346 |
End bp | 2474914 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445084 |
Product | hypothetical protein |
Protein accession | YP_497599 |
Protein GI | 87200342 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) [TIGR02914] EpsI family protein [TIGR03109] exosortase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.338173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGCCGC CTGATCTCGC GCTCAGGGCC CCGCTCGGCC GCGCCTGGAC CGCGCTGCCC GCCCAGTGGC GGCGTGCGCT CGGCCTGCTC GCGCTGGCGT GGCTCGGCAA CGTATTGCTC TTTGCGGCCG ACTGGCGCGC GATGTTCCTG CAATGGTGGG ACAGCTCGAC ATACAATCAC GTCCTGCTGA TCCCCTTCAT CCTCGGCTGG CTCGTCTCGC TGCGCTGGCG CGAGGTGGTG AAGGTCGCGC CGCAGGGCTG GTGGCCAGGG CTGGTCCTCT TCGCGGGAGC CGGCTTCCTC TGGCTGCTTG GCGATTTCGC GGGCCTTTCG CTTGCCACGC AGCTTGGCGT GGTGCTGATG GCGCAGGCGA GCGCGCTGAC GCTGCTGGGG CCGCGCGTTT CCGCCGCGCT GGCGTTCCCG CTCGCCTACA TGCTTTTCCT CGTCCCCGCC GGCGACGAGC TGATCCCCAC GCTGCAGACG ATCACTGCGC GAATCACCAT GGCGCTGCTC GACCTCAGCC AGGTCCCGGC GCACATCGAG GGCGTGTTCA TCACCACCCC CGGCGGCTAT TTCGAGGTCG CGGAGGCGTG CTCCGGTGTG AAGTTCCTGA TCGCGATGGT CGCCTATGGC GCGCTGGTCG CGAACGTCTG CTTTGCCACC TGGACCCGCC GTGCGGCGTT CATGGCGCTG AGCGTGGCCA TGCCGATCCT CGCCAACGGC GTGCGCGCAT GGGGCACGAT CTTCATCGCC GAACACCACG GCATCGAATT CGCGGCGGGC TTCGACCACG TGTTCTACGG GTGGATATTC TTCGCCATGG TCATGGCGAT GGTCATGGTC CTCGCATGGC GCTTCTTCGA CCGCGCGATC GACGACCGCA TGATCGATCC CGACGCCATC GCCGCAAGCC CTGTCCTTGG CCGCCTGTCC GGCTTTGCCA TGGCGCAACC CCGGGCCCTT GCGGCCGCAG CCGCCATCGC CCTGCTGTTC GCAGGCTGGG GCGCCTGGGC CAACAGCCTC GAGGCCGCGA TCCCCGCTCG CATAGATTTC CCCGGTGTGC CGGGCTGGCA GCAAGTCGAC TATGCCCCCC AATCGCACTG GAAACCGCTC CACGGAGGCG CTTCCCACGT CCTCCTGGGC CGCTTCCGCG ACGGTGCGGG CCACACCGTC GACGTTTCCT ACGCGCTCTA CGCCATGCAG GCAGACGGGC ACGAGGCGGG AGGTTTCGGG CAGGGAGCCA TCCCGCTTGG CGGTGGCTGG GCCTGGGAGC GCTCTGCCGC CCCGCTCGCG GGAGGCCACG CCGACCGCAT CCAGACAGCC GGTCCCGTCC ATCGCCTGGC AGAGACCTTC TACCGCTCGG GCAGCCTCTT CACCGGTAGC AACACCCGCC TGAAACTGCG GAATATCCTC GACCGTCTGC TGTTGCGGGA GCGGACCACG GCCACGCTCA TCCTCTCCGC CGAAGACGAC ATTCCCGGCC AGCCGTCGGC CGAACAGTCC ATGCGCGCCT TCCTGTCGGC CATCGGTCCG GTCGATGCGT GGATGGACCG CGCTGCCTTG CCCCGCTAG
|
Protein sequence | MSPPDLALRA PLGRAWTALP AQWRRALGLL ALAWLGNVLL FAADWRAMFL QWWDSSTYNH VLLIPFILGW LVSLRWREVV KVAPQGWWPG LVLFAGAGFL WLLGDFAGLS LATQLGVVLM AQASALTLLG PRVSAALAFP LAYMLFLVPA GDELIPTLQT ITARITMALL DLSQVPAHIE GVFITTPGGY FEVAEACSGV KFLIAMVAYG ALVANVCFAT WTRRAAFMAL SVAMPILANG VRAWGTIFIA EHHGIEFAAG FDHVFYGWIF FAMVMAMVMV LAWRFFDRAI DDRMIDPDAI AASPVLGRLS GFAMAQPRAL AAAAAIALLF AGWGAWANSL EAAIPARIDF PGVPGWQQVD YAPQSHWKPL HGGASHVLLG RFRDGAGHTV DVSYALYAMQ ADGHEAGGFG QGAIPLGGGW AWERSAAPLA GGHADRIQTA GPVHRLAETF YRSGSLFTGS NTRLKLRNIL DRLLLRERTT ATLILSAEDD IPGQPSAEQS MRAFLSAIGP VDAWMDRAAL PR
|
| |