Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3922 |
Symbol | |
ID | 5703773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4464757 |
End bp | 4465857 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641273347 |
Product | hypothetical protein |
Protein accession | YP_001538704 |
Protein GI | 159039451 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01140] L-threonine-O-3-phosphate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.753113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0645369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGCGC AGCCGATCGC GGGACGACCT GGGCCGCCGA TCATCCCGTC GGCGCCCGAG CCCGACCTCG GCCATCACGG GGATGCCGAG GCCACCCCCG GTTTGGTTGA TCTTGCCGTG AACGTGCGCC GAGCCCCGAT GCCGGAATGG CTCGCCGACC CGATCACCGC CGCGCTCGGC GACCTCGCCG GATACCCGGA CCCAACCCCG GCGCGGGCCG CCGTGGCTGC CCGGCATCGC CGACCGCCAG CCGAGGTGCT GCTCACCACC GGCGCCGCCG AGGGCTTCGT GCTCGTTGCC CAGGCATTGC GTGGGATCCA CCGCCCGGTG GTTGTGCACC CGCAGTTCAC CGAGCCGGAG GCAGCCCTGC GGGCGTCCGG TCACCAGGTC GAGCGGGTGC TGCTCGACCC CGACGACGGG TTCCGACTCG ACCCCGCCCG CATCCCGGTG GACGCCGACC TGGTCATGAT CGGTAACCCC ACGAACCCGA CCTCGGTGCT GCACCCGGCT GCCGATGTGG CCGCGCTCGC CCGGCCCGGC CGCGTCCTCG TCGTCGACGA GGCGTTCGCC GACACCACCA TCGCACCCGG GGGAGCCGGC GAGCCCGAGT CGCTCGCCGG CCGCGGCGAC CTACCCGGCC TGCTGGTCAT CCGAAGCCTC ACCAAGACGT GGGGGCTGGC CGGGCTGCGC GTCGGCTACC TGCTCGGTGC GGCGGACCTG CTGGATCGAC TGGCCGCCGT GCAGCCGCTG TGGGCGGTCT CCACCCCGGC CCTCGCCGCC GCGACGGCCT GCGCCGCGCC CGAGGCGGTG CGAGCCGAAC GCTTGATCGC CGCCCGCCTC GCCGCCGACC GCGACCACCT GGTCGCCCGC CTGGCCGCCC TGCCGGGAGT ACGCGTCGTT GGCCAACCGG CAAGCGCCTT CGTCCTCGTT CACTGGCCGG GCGCCGACGC GGTCCGCCGT GCCCTGCGGG AACGCGGCTG GGCCGTACGC CGCGGCGACA CGTTCCCCGG ACTGGGGCCG GACTGGCTAC GGATCGCAGT CCGTGACCGG GCAACCACCG ACGCGTTCAT CACGGTGCTG GCGCAGATCC TGGAGGCATG A
|
Protein sequence | MRAQPIAGRP GPPIIPSAPE PDLGHHGDAE ATPGLVDLAV NVRRAPMPEW LADPITAALG DLAGYPDPTP ARAAVAARHR RPPAEVLLTT GAAEGFVLVA QALRGIHRPV VVHPQFTEPE AALRASGHQV ERVLLDPDDG FRLDPARIPV DADLVMIGNP TNPTSVLHPA ADVAALARPG RVLVVDEAFA DTTIAPGGAG EPESLAGRGD LPGLLVIRSL TKTWGLAGLR VGYLLGAADL LDRLAAVQPL WAVSTPALAA ATACAAPEAV RAERLIAARL AADRDHLVAR LAALPGVRVV GQPASAFVLV HWPGADAVRR ALRERGWAVR RGDTFPGLGP DWLRIAVRDR ATTDAFITVL AQILEA
|
| |