Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3830 |
Symbol | |
ID | 5704854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4361985 |
End bp | 4363379 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273252 |
Product | CBS domain-containing protein |
Protein accession | YP_001538614 |
Protein GI | 159039361 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0304655 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGGCT CTTCCGCCGC AACAACGGTC GGCTCTGCCG GCCTTCCCGA CGTGCAGCTG ATCGTGGTCG CGGCCGGGCT GGTGGTCCTC GCCGGCCTGA TCGCGATGAC GGAGGCCGCG CTCTCCGCCG TCTCTCCGGC ACGCGCCGCC GAACTGGCCC GCGATGGCGC CCGTGGCGCC CGAGCGTTGC AGTCCGTCGC GAGTGACGTG GTCCGGCACC TCAACCTGCT GCTGCTGTTG CGGCTGCTCA CCGAGCTGAG CGCGACCACT CTGGTGGCGC TGGTCGCGGT CGACTCGTTC GGCGCTGGTT GGCGGGCCGC GCTGGTGACG GCCGGGGCGA TGACCGTGGT CAGCTTCGTG GTGGTCGGCG TCGGGCCGCG CACGATCGGC CGGCAGCATG CCTACGCGGT GGGTCGCGGC GTGGCGCCGC TGGTGCGTTG GCTGGGTCGG GCGCTCAACC CACTCGCCTC CCTGCTGATC CTGATCGGCA ACGCGGTCAC CCCGGGGCGG GGCTTCCGGG AGGGGCCCTT CGCCACCCAG GTGGAGCTGC GCGAACTGGT GGACCTGGCC GAGCAGCGCG GTGTGGTGGA GCATGGCGAG CGGCAGATGA TCCACTCCGT CTTCGCGCTC GGCGACACCA TCGCCCGCGA GGTGATGGTG CCGCGTACCG AGATGGTGTG GATCGAGCGG CACAAGATGC TGTCCCAGGC CCTGGCGCTC TTTCTGCGGT CCGGCTTCTC TCGGATTCCG GTGATCGGCG AGAGCGTCGA CGACGTGCTC GGCGTGCTCT ATCTGAAGGA TCTGATCCGG CGCACGCAGG GCGGTGCCCC GGAGGACCGA CGCCTCCCCG TGGCCGAGCT GATGCGTCCG GCCACCTTCG TGCCGGAATC CAAGCCGGTC GACGACCTGC TCTCGGAGAT GCAGGCCGCC CGGAACCACC TGGTAATCGT CGTTGACGAG TACGGCGGTA CCGGCGGGTT GGTCACCATC GAGGACATCC TGGAGGAGAT CGTCGGCGAG ATCACCGACG AGTACGATGT CGAGCGCCCA CCGGTCGAGC GCCTCGACGA CGACGCGGTG CGGGTCACCG CGCGGCTTCC CGTGGATGAC CTCGGCGAGT TGTTCGACAC CGAGCTGCCC GGCGACGAGG TGGAGACGGT GGGCGGACTG CTCGCGCAGT CGCTGGGCCG GGTTCCGATC CCCGGTGCCC AGGTCGAGGT GGCTGGTCTG CGGCTGCTCG CCGAGGGCAC CACCGGCCGG CGCAACCGGA TCGACACGGT GCTGGTGCGC CGGGTGGAGC CGGCCGACCA GCAGCACGAT CCGGGTCGCG GTGAACCGAC CGAGACCCGG GACGACACCG ACCAAGCCGA GGAGAGGCAA CCCGCCGATG CCTGA
|
Protein sequence | MMGSSAATTV GSAGLPDVQL IVVAAGLVVL AGLIAMTEAA LSAVSPARAA ELARDGARGA RALQSVASDV VRHLNLLLLL RLLTELSATT LVALVAVDSF GAGWRAALVT AGAMTVVSFV VVGVGPRTIG RQHAYAVGRG VAPLVRWLGR ALNPLASLLI LIGNAVTPGR GFREGPFATQ VELRELVDLA EQRGVVEHGE RQMIHSVFAL GDTIAREVMV PRTEMVWIER HKMLSQALAL FLRSGFSRIP VIGESVDDVL GVLYLKDLIR RTQGGAPEDR RLPVAELMRP ATFVPESKPV DDLLSEMQAA RNHLVIVVDE YGGTGGLVTI EDILEEIVGE ITDEYDVERP PVERLDDDAV RVTARLPVDD LGELFDTELP GDEVETVGGL LAQSLGRVPI PGAQVEVAGL RLLAEGTTGR RNRIDTVLVR RVEPADQQHD PGRGEPTETR DDTDQAEERQ PADA
|
| |