Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3451 |
Symbol | |
ID | 5059920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 3957818 |
End bp | 3959212 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640475700 |
Product | CBS domain-containing protein |
Protein accession | YP_001160260 |
Protein GI | 145595963 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.553184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0782726 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCCT CTCTTGCCGC TGCGACGGCC GGCGCTGCCG GCCTCCCCGA CGTGCAACTG GTCTTCCTCG CGGCCGGGCT GGTGGTCCTC GCCGGCCTGA TTGGGATGAC CGAGGCGGCG CTCGCCGCCG TCTCTCCGGC CCGCGCGGCC GAGTTGGTCC GCGACGGAGC CCGGGGTGCC CGAGCGTTAC AGTCCGTCGC CGGTGATGTC GTCCGGCACC TCAACCTACT GCTGCTGTTG CGGCTGCTCG CCGAGCTGAG CGCGACCACC CTGGTGGCTC TGGTCGCGGT CGACTCGCTC GGCGCGGGCT GGCGGGCCGC GGTGGTCACG GCCGGGGCGA TGACCGTGGT CAGCTTCGTG GTGGTCGGCG TCGGGCCGCG CACCATCGGG CGTCAGCACG CCTATGCGGT GGGGCGTGGG GTGGCGCCGC TGGTGCGTTG GCTGGGTCGG GCCCTCAACC CCCTCGCCTC TCTGTTGATC CTGATCGGTA ATGCGGTCAC GCCGGGGCGC GGCTTCCGGG AGGGTCCGTT CGCCACCCAG GTGGAGTTGC GCGAGCTGGT CGACCTGGCC GAGCAGCGCG GCGTGGTGGA GCATGGCGAG CGGCAGATGA TCCATTCCGT CTTCGCGCTC GGTGACACCA TCGTCCGCGA GGTGATGGTG CCGCGCACCG AGATGGTCTG GATCGAGCGC CACAAGATGC TCTCCCAGGC CCTGGCGCTC TTTCTGCGTT CCGGCTTCTC CCGCATCCCG GTCATCGGCG AGAGCGTCGA CGATGTGCTC GGCGTGCTCT ACCTGAAAGA TCTGATTCGG CGTACGCAGG GCGGCGCCCC GGAGGACCGG CGTCTCCAGG TAGCCGAGTT GATGCGCCCG GCCACCTTCG TGCCGGAGTC CAAGCCGGTT GACGACCTTC TCTCGGAGAT GCAGGCTGCC CGGAACCACC TGGTGATCGT CGTTGACGAG TACGGCGGGA CGGGCGGCCT GGTCACCATC GAGGACATCC TGGAGGAGAT CGTCGGCGAG ATAACCGACG AGTACGATGT TGAGCGTCCG CCGGTCGAGC ACCTCGACGA CGACGCGGTG CGGGTCACCG CGCGGCTACC CGTGGAGGAC CTCGGCGAGT TGTTCGACAC CGAGCTGCCC AGCGATGAGG TGGAGACGGT GGGCGGCCTG CTCGCGCAGT CCCTGGGCCG GGTTCCGATC CCCGGCGCCC AGGTTGAGGT GGCCGGTCTA CGGCTGCTCG CGGAGGGCAC CACCGGGCGG CGCAACCGGA TCGACACGGT CCTGGTGCGC CGGGTGGAGC CGGCCGACCA ACAGCACGAT CCGGGGCGCG GCGACACCGC CGACGTCCCG GGCGACACCG ACCAAGCCGA GGAGAGGCAG CCCGCCGATG CCTGA
|
Protein sequence | MMASLAAATA GAAGLPDVQL VFLAAGLVVL AGLIGMTEAA LAAVSPARAA ELVRDGARGA RALQSVAGDV VRHLNLLLLL RLLAELSATT LVALVAVDSL GAGWRAAVVT AGAMTVVSFV VVGVGPRTIG RQHAYAVGRG VAPLVRWLGR ALNPLASLLI LIGNAVTPGR GFREGPFATQ VELRELVDLA EQRGVVEHGE RQMIHSVFAL GDTIVREVMV PRTEMVWIER HKMLSQALAL FLRSGFSRIP VIGESVDDVL GVLYLKDLIR RTQGGAPEDR RLQVAELMRP ATFVPESKPV DDLLSEMQAA RNHLVIVVDE YGGTGGLVTI EDILEEIVGE ITDEYDVERP PVEHLDDDAV RVTARLPVED LGELFDTELP SDEVETVGGL LAQSLGRVPI PGAQVEVAGL RLLAEGTTGR RNRIDTVLVR RVEPADQQHD PGRGDTADVP GDTDQAEERQ PADA
|
| |