Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1722 |
Symbol | |
ID | 5703421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1993847 |
End bp | 1994767 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641271225 |
Product | hypothetical protein |
Protein accession | YP_001536600 |
Protein GI | 159037347 |
COG category | [R] General function prediction only |
COG ID | [COG1938] Archaeal enzymes of ATP-grasp superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.901753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000637714 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCTCGACC CACACGAGCT CTACGAACTC GCCGACGATC TGCCCGACCT CGGGCAGCCC GTCCTGATCC AGGCGCTCTC CGGCTTCGTC GACGCCGGTA ACGCCACCCG GTTGGCCCGC GAGCAACTGC TCACCTCGCT CGATGCGCGG CCGGTGGCCC GGTTCGACCT GGACCAGCTC TTCGACTACC GCTCGCGCCG GCCGGTGATG ACCTTCGTCG AGGACCACTG GGAGTCCTAC GACGCTCCGG CCCTGGAACT GCACCTGCTC CGCGACGACG CCGACACCCC GTTTCTCCTG CTCACCGGCC CGGAGCCGGA CCTGCAGTGG GAACGTTTCG TCGCCGCCGT GGCCGGGCTC GCCACCCGGC TGGACGTCCG GCTGACCGTC GGGCTCAACG CCATCCCGAT GGCGGTGCCA CACACCCGCC GCACCGGTGT CACCGCGCAC GCCACCCGGC GTGAGCTGAC CGCAGGCTAC GAGCCGTGGC TGCAACGTGT GCAGGTGCCG GGCAGCGTGG GATACCTGCT CGAATACCGC CTCGGCGAGC AGGGGCGCGA CGCACTGGGC TTCGCCGCAC ACGTGCCGCA CTACGTCGCG CAGACCGAGT ACCCCGCCGC GGCCGAGGTG CTGCTCTCCT CGGTGTCGCG CAGCACCGGG CTACTCCTGC CCTGCGACGA ACTGCGGGCC GCGACCGAGG CGGTCCGGAC GGAAATCGAC CGACAGGTCG CCCAGACCGA GGACGCCGCG GCGCTGGTTC AGGCGCTCGA GGAACAGTAC GACGCCTTCA CCCGCGGGCG CGGCCAGCCG AACCTCCTCA ACACCGGGGC GGGGTCCCTG CCGACCGCCG ACGAACTCGG CGCCGAGTTG GAACGCTTCC TGGCCGAGCA GACCCGCCCC AACGACAACC CGGGCGGCTG A
|
Protein sequence | MLDPHELYEL ADDLPDLGQP VLIQALSGFV DAGNATRLAR EQLLTSLDAR PVARFDLDQL FDYRSRRPVM TFVEDHWESY DAPALELHLL RDDADTPFLL LTGPEPDLQW ERFVAAVAGL ATRLDVRLTV GLNAIPMAVP HTRRTGVTAH ATRRELTAGY EPWLQRVQVP GSVGYLLEYR LGEQGRDALG FAAHVPHYVA QTEYPAAAEV LLSSVSRSTG LLLPCDELRA ATEAVRTEID RQVAQTEDAA ALVQALEEQY DAFTRGRGQP NLLNTGAGSL PTADELGAEL ERFLAEQTRP NDNPGG
|
| |