Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2791 |
Symbol | |
ID | 3916951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3012112 |
End bp | 3014172 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640445570 |
Product | cytochrome c biogenesis protein, transmembrane region |
Protein accession | YP_498061 |
Protein GI | 87200804 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATAC GCGAATTGGT CGGGATCAAG ACCCGCCTCG ATCCTCTCGG GCGCATCATA GCGATGCTTG TGACAGCGAT GTGTCTGTCG CTTCTCGCGT GGCTTTCCCC CGTCCATGCC GCGCCCAACC GCATGACCGC ATCGCTGGTC GCGGAGGGCC CGGTCCGCCC CGGAGAAACG GTCACGCTCG CCCTGCTCAT GCAGCCCGAA AAGGGGTGGC ACGGCTACTG GTCCAACCCG GGCGACGCCG GCTTCGGCCT GACCGTCGAG TGGACCCTGC CGGCGGGCGT GACCGCAGGC GATATGCAGT TCCCTGTCCC GCAGACGCTC GTCGTTCAAG GGCTGATGAA CCACGTCTAC GAGCATGACT ATGCTGTCCT CGTGCCGCTC AAGGTGCCTT CCAGCGCCCG TTCGGGAACG CTCCTGCCAA TCAGGGTCAA GGCCCAGTGG CTCGTCTGCA CCGAAACGAT CTGCGTGCCC GAACGCGCCG AATTGCAAGG CGCGGTTAGC GTCGGCACCG GCCCCGCCGA CCCGCGCTTC GAAGGATGGC GCGCCGCCAT GCCAGCCCCG CTCGATCGTC CGGGCGCATT TGCGCTCGAA AGCGGCACGC TGCGCCTCGC CCTCCCGTTC CCGGCCACCG CGCCGCTCGA TGCGCCGCAC CTTTTCCTCA AGACCGAACG CGTGGTCGAT TACGCCGCGC CGCAACGCTT CTTCCGCCAG GGCGACACAC TTGTCGTCGA AGTTCCGCTG GCCAAGTCGC CCGCCACCGC CTCAAGGCTC GAAGGCGTCC TGGCGATGGG CAGGGACGCG GGCGGCATTG CCTTCGCCGC CGAGCCCGGC ACCGTGCCGA AGGGCGGCAC GCAAGTAGGC GCTGGCGCCT TCTCCGCCTC GGTCCTCGCG CTTGCTCTTG CCGGAGCGTT CCTCGGCGGG CTGATCCTCA ACGTGATGCC CTGCGTCTTC CCCATACTCA GCCTCAAGGC CCTCGCTCTG GCCCGGGGCA ACGCGCATGA AGCCCACAAG GAGGGCCTGG CCTATACCGC CGGCGTCGTC CTCGCGTGCC TCCTTCTGGG CGGCGTGATG CTGGCCCTGC GCGCAGGCGG AGAACAGGTC GGATGGGCGT TCCAGTTGCA GGAACCGGGC GTCGTCGCCG TCCTCATGCT CCTGGCGGTT GCGATCACGG CAAACTTCGC GGGGCTCTAC GAACTTCCCG GTCTTTCGGT CGAGCGCGGT GCGGGCGCGA CCGGGGCGTT CGGCACGGGC CTGCTTGCCG CCTTCGTCGC CACGCCCTGC ACCGGGCCGT TCATGGCCGC CGCCATGGGC GCGGCGCTCG TCCTGCCGGC GTGGGCAGCG CTAGGCGTCT TCGCGGCGCT GGGGCTAGGC CTCGCCCTGC CGTTCCTGCT CCTGGGCTTC GTGCCGGCCC TGCGCCGCCT TCTGCCCAAG CCCGGCAAGT GGATGGAGCG CTTCCGCCGC TGGATGGCGC TGCCGATGGG CCTCACCGCG CTGGCGCTGG GCTGGCTGGC ATGGCGCGTG GGCGGGGCGA CCTTCACCGT CGTTCTGGTT GCAATCGCCG TGGTGTTGCT CGGCGCACTT GCGGCTGTCG GGGTGACGCA GCGCAAGGGC CTGCCGATCG GCAGGCTCAT GGCCCTGGCT GCCCTGATCG CCGTCGGCGG CGTGATCGGC CTGCCCCCGC CCGCGCAGAC CGCCGCCAGC GAAGGTGGCC TGCTCGAGGC CCGGCCCTTC TCGGAAGCCG CGCTGGCCGA GGCCCGCAAG GCAGGAAAGC CCGTCTTCGC CTACTTCACC GCCGACTGGT GCCTGTCCTG CAAGGTCAAC GAGAGCACCT CGATCGAGCG CGAGAGCACC CGCGCGGCGT TCGAGAAGGC CGGCGTCGTC GTCCTTGTCG GCGACTGGAC CCGGCGCGAC CCGGCGATCA CCCGGTTCCT CACCGCGCAG GGCGTCGCAG GCGTGCCGCT ATATATGTGG TATCCTGCCG GCGGCGCCGA GCCCCGCCAG CTTCCGCAGG TCCTCACGCC CGACATGCTG GCCGCACTGC CGGGGCAATA G
|
Protein sequence | MPIRELVGIK TRLDPLGRII AMLVTAMCLS LLAWLSPVHA APNRMTASLV AEGPVRPGET VTLALLMQPE KGWHGYWSNP GDAGFGLTVE WTLPAGVTAG DMQFPVPQTL VVQGLMNHVY EHDYAVLVPL KVPSSARSGT LLPIRVKAQW LVCTETICVP ERAELQGAVS VGTGPADPRF EGWRAAMPAP LDRPGAFALE SGTLRLALPF PATAPLDAPH LFLKTERVVD YAAPQRFFRQ GDTLVVEVPL AKSPATASRL EGVLAMGRDA GGIAFAAEPG TVPKGGTQVG AGAFSASVLA LALAGAFLGG LILNVMPCVF PILSLKALAL ARGNAHEAHK EGLAYTAGVV LACLLLGGVM LALRAGGEQV GWAFQLQEPG VVAVLMLLAV AITANFAGLY ELPGLSVERG AGATGAFGTG LLAAFVATPC TGPFMAAAMG AALVLPAWAA LGVFAALGLG LALPFLLLGF VPALRRLLPK PGKWMERFRR WMALPMGLTA LALGWLAWRV GGATFTVVLV AIAVVLLGAL AAVGVTQRKG LPIGRLMALA ALIAVGGVIG LPPPAQTAAS EGGLLEARPF SEAALAEARK AGKPVFAYFT ADWCLSCKVN ESTSIEREST RAAFEKAGVV VLVGDWTRRD PAITRFLTAQ GVAGVPLYMW YPAGGAEPRQ LPQVLTPDML AALPGQ
|
| |