Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2420 |
Symbol | |
ID | 3916739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2598959 |
End bp | 2600401 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640445175 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_497690 |
Protein GI | 87200433 |
COG category | [S] Function unknown |
COG ID | [COG3538] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.226342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCG ACCGGCGACG GATGATGGCG GGTGCGGCGG CACTGGGCGG GATGGCCGCG CTTGGCCCGA TGCGCGCCCT CGCCGCGCAG GCTGCCACTG CGACTGCCCG TCCGGAACCT GCCGACCGCC TGTTCGCCAG CCCCGCCGTG GAACGCGAGA TTGCGCGCGT ATCGGCATTG ATCGCCGATG CCGACCTCCG GCGGCTCTTC GTCAACTGCT ATCCGAACAC GCTCGACACC ACGGTCCATC TATCCAGCGT CGAGGGTCGT CCCGACGCTT TCGTCATCAC CGGCGACATC GACTGCATGT GGTTGCGCGA CAGTTCGGCG CAGCTCAATC CCTACCTGCA CCTCGTGCGC GAGGACGAGG CGTTGCGCGG TCTGTTCCGT GGGCTGATCG CGCGGCAGGC GCGCTCGATC CTGATCGACC CCTATGCCAA TGCATTCATG CGCGACCCGT CGGCAAGCAC GAACCTGCCA TGGGCGCTCG CCGACGATAC CGAGATGAAG CCGGGCGTGG CGGAGCGGAA GTGGGAAGTG GATTCGCTCT GCTACCCGAT GCGCCTTGCC CATGACTACT GGAAGGCGAG CGGCGACACC GCGCCGTTCG ACGCGCTCTG GGCCGAGGCG GCCTGGGCCA GCATCCGCAC CTTCCGCGAA CAGCAGCGCA AGGACGACCC CGGCCCCTAC CGCTTCCTGC GCCGCGACAA GCTGGCGACC GAGACGCAGA TCCTTGGCGG CTATGGCGCG CCGACGCGCA AGGTGGGCAT GATCCACAGC ATGTATCGCC CTTCCGACGA TGCCTGCGTG TTTCCCTTCC TGGTGCCGTC GAACCTCTTC GCCGTCGCCG CCCTGCGCAA GCTGGCGGCG CTGGCGGGTG CGGTGCAGCA GGCCAAGCTT GCCAGTGCCG CGCTGGACCT GGCGCGCGAG GTGGAGCTGG CCACCTATGC CAACGGCACG ATCATCGATC CGGCCAGCAA CGAACGGCTC TGGGCCTACG AGGTCGACGG CTTCGGCAAC GGACACTTCA TGGACGATGC CAACGTGCCC AGCCTGTCGA GCCTTGCCTA TCTCGGCGCG GTCCCTTCGG ACGATCCGCT GTTCCTGCGC ACCCGCGCCG CCGCGTGGAG CGAGCGCAAT CCGTACTTCT TCAAGGGCAC CGCCGCGGAA GGCATCGGCG GCCCCCACGC CGGGCTGCGC ATGATCTGGC CGATGGCAAT CACCATGCGC GCGCTGTCGA GCGACGACGA CGCAACGATC CGCCAGTGCC TGGCCATGCT CAAGGCCAGC CACGCCGGCA CCTTCTTCAT CCACGAGGCT TTCGACCAGG ACGATCCGGC GAAGTTCACC CGCCACTGGT TCGCCTGGGC CAACGGCCTG TTCGGAGAGC TGATGATAGA CCTCGCCAAC CGCAAGCCGG CATTGCTGGG AGAAGCCGCA TGA
|
Protein sequence | MKIDRRRMMA GAAALGGMAA LGPMRALAAQ AATATARPEP ADRLFASPAV EREIARVSAL IADADLRRLF VNCYPNTLDT TVHLSSVEGR PDAFVITGDI DCMWLRDSSA QLNPYLHLVR EDEALRGLFR GLIARQARSI LIDPYANAFM RDPSASTNLP WALADDTEMK PGVAERKWEV DSLCYPMRLA HDYWKASGDT APFDALWAEA AWASIRTFRE QQRKDDPGPY RFLRRDKLAT ETQILGGYGA PTRKVGMIHS MYRPSDDACV FPFLVPSNLF AVAALRKLAA LAGAVQQAKL ASAALDLARE VELATYANGT IIDPASNERL WAYEVDGFGN GHFMDDANVP SLSSLAYLGA VPSDDPLFLR TRAAAWSERN PYFFKGTAAE GIGGPHAGLR MIWPMAITMR ALSSDDDATI RQCLAMLKAS HAGTFFIHEA FDQDDPAKFT RHWFAWANGL FGELMIDLAN RKPALLGEAA
|
| |