Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0409 |
Symbol | |
ID | 5703802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 469796 |
End bp | 470716 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641269934 |
Product | proline dehydrogenase |
Protein accession | YP_001535329 |
Protein GI | 159036076 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00302013 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCCGTA CCGTCATCCT TGCCGCGGCC CGGTCATCCC GGTGCGAGCA GTTCGTCGCG ACGGCCCCAT ACACCCGGGA CATCGTGCGT CGGTTCGTCG CCGGCGCCCG AACCGACGAC GCGTTGCGTG TCACCCGCGC CCTCGTCCAG GACGGGCTCG CGGTCACCCT CGACAACCTC GGCGAGGACA CGGTCACGCC CGAGCAGGCC ACCGCCGTCC GCGACGAGTA CCTCAACCTG CTGGAGTTGC TCGCCGCCGC CGGGCTCACG CCGGCCACCG AGGTGAGCGT CAAGCTCTCC GCGCTCGGGC AGACGTTTGA CGAGCAGCTC GCGTACGACC ACGCGCGGGC GATCTGCGTG GCCGCCGACA CTGCGGGCAC CATGGTCACC CTCGACATGG AGGACCACAC CACCACCGAC TCGACCCTGG ACATCCTCCT CAAGCTGCGC GAAGACCACC CGTCGACCGG GGCGGTGCTA CAGGCGTACC TGCGGCGGAC GGAGTCGGAC TGCCGGCAAC TGGCCGGCGC GGGCTCCCGG GTCAGGCTGT GCAAGGGCGC CTACCGGGAA CCCGAGTCGG TGGCCTACCA GTCTGCTCAC GACGTGGACA GGTCGTACGT ACGCTGCCTG AACATCCTGA TGTCCGGTGC CGGCTACCCG ATGCTCGCCA CCCACGACCC TCGCCTGATC GCGATCGGCG AGGACCGGGC CCGCTGGTTC GACCGGGGGC CGGACCGGTT CGAGTTCCAG ATGCTCTACG GCATCCGCCC CGAGGAACAG GCCCGCCTGG TCGGCGACGG GTACACCGTG CGAACGTATG TCCCGTACGG CGACCAGTGG TACGGCTATC TCATGCGTCG CCTGGCCGAG CGTCCCGCCA ATCTGGCCTT CTTCGCCCGC GCCCTGATGC ACCAGGGGTA G
|
Protein sequence | MLRTVILAAA RSSRCEQFVA TAPYTRDIVR RFVAGARTDD ALRVTRALVQ DGLAVTLDNL GEDTVTPEQA TAVRDEYLNL LELLAAAGLT PATEVSVKLS ALGQTFDEQL AYDHARAICV AADTAGTMVT LDMEDHTTTD STLDILLKLR EDHPSTGAVL QAYLRRTESD CRQLAGAGSR VRLCKGAYRE PESVAYQSAH DVDRSYVRCL NILMSGAGYP MLATHDPRLI AIGEDRARWF DRGPDRFEFQ MLYGIRPEEQ ARLVGDGYTV RTYVPYGDQW YGYLMRRLAE RPANLAFFAR ALMHQG
|
| |