Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3547 |
Symbol | |
ID | 5060021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4066176 |
End bp | 4067366 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640475801 |
Product | hypothetical protein |
Protein accession | YP_001160356 |
Protein GI | 145596059 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01140] L-threonine-O-3-phosphate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGAAATA TTCATGAACC CGTCGCCCGT GACGCGGCCC ACGTCGCAAT GCGCTGTGCC ACGCGCTCCC TTCCCGGCAG GATCGCGGTC ATGCGTACGC AGTCGAGCGG GCGAGGGGCC GCGCCGTCGG ACAACCCGGC GGCACCCGAA CTCGACCTCG GCCACCATGG AGATGTCGAG GCCGTACCCG GTCTGGTGGA CCTCGCGGTG AACGTGCGCC GGGCCCCGAT GCCGGACTGG CTGGCCGACC CGATCACCGC CGCCCTCGGT GACCTGGCCC GATACCCGGA CTCCGCGGCG GCGCGGGCTG CTGTTGCCGC CCGGCACGGT CGGCCGCCGG CCGAGGTGCT GCTCACCGCC GGCGCTGCCG AGGGCTTCGT CCTGATCGCC CAGGCGCTGC GCGGGATCCG CCGCCCGGTG GTGGTGCACC CGCAGTTCAC CGAGCCGGAG GCGGCCCTGC GCGCGGCTGG GCACCGGGTC GAGCGGGTGC TGCTCGACCC CGAAGACGGG TTCCGGCTCG ACCCCACCCG GGTCCCGGCG GATGCCGACC TGGTCATGAT CGGTAACCCC ACCAACCCGA CCTCGGTGCT GCACCCGGCC GCCGACGTGG CCGCACTCGC CCGGCCCGGC CGGGTTCTCG TCGTGGACGA GGCGTTCGCC GACACCACCC TCACGCCCGG AGTCGCTGGC GAGCCGGAGT CGCTTGCTGC CCGGCCCGAC CTGCCTGGCC TGCTGGTTGT TCGGAGCCTC ACCAAGACGT GGGGGCTGGC CGGGCTGCGC GTCGGCTACC TGCTCGGTGC ACCGGATCTG CTGGACCGCC TGGCCGCCGT GCAGCCGCTG TGGGCGGTCT CCACCCCGGC CCTCGCCGCC GCGACAGCCT GCGCCGGACC CGAAGCGGTG CGAGCCGAAC GCCAGATCGC TGCCGACCTC GCCGCCGATC GCGAATACCT CCTCACCCGC CTGGCGGCCC TACCGGGAAT ACGCGTCGTC GGCCAGCCGG CGAGCGCCTT CGTTCTCGTC CACCGCCCCG GCGCTGACGC GGTACGCCGC GCCCTGCGAG AGCGAGGCTG GGCAGTACGC CGCGGCGACA CCTTTCCCGG ACTGGGTCCG GAATGGCTGC GGATCGCGGT CCGCGATCCG GCGACGACCG ACGCGTTTAC CACCATGCTG GCAGAGGTTC TGGAGGGATG A
|
Protein sequence | MRNIHEPVAR DAAHVAMRCA TRSLPGRIAV MRTQSSGRGA APSDNPAAPE LDLGHHGDVE AVPGLVDLAV NVRRAPMPDW LADPITAALG DLARYPDSAA ARAAVAARHG RPPAEVLLTA GAAEGFVLIA QALRGIRRPV VVHPQFTEPE AALRAAGHRV ERVLLDPEDG FRLDPTRVPA DADLVMIGNP TNPTSVLHPA ADVAALARPG RVLVVDEAFA DTTLTPGVAG EPESLAARPD LPGLLVVRSL TKTWGLAGLR VGYLLGAPDL LDRLAAVQPL WAVSTPALAA ATACAGPEAV RAERQIAADL AADREYLLTR LAALPGIRVV GQPASAFVLV HRPGADAVRR ALRERGWAVR RGDTFPGLGP EWLRIAVRDP ATTDAFTTML AEVLEG
|
| |