Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_10371 |
Symbol | |
ID | 4780610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 955387 |
End bp | 956133 |
Gene Length | 747 bp |
Protein Length | 248 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640084316 |
Product | putative CbbY-like protein |
Protein accession | YP_001014860 |
Protein GI | 124025744 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.549559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000687535 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATAAAT TAAAAGCAGT CTTTTGGGAT GTGGATGGAA CAATTGCCGA TACAGAATTG TGTGGGCATA GAGTCGCTTT TAATTTGGCC TTTAAGGATT TTGATTTGGA TTGGAACTGG AATGAGAGCC AATATTTGGA TCTTCTAAAA ATATCGGGAG GTTTTAACCG CATAATCCAC TATCGAAACA AAATCGATAG TGACATTACT GAAAGCAAAT GTTCCGAGAT TCAAGCCAGG AAGCGTATTC ATTACAAGAA ATTAATTCAA TCTGGCAAAA TTAAAGTAAG AGAGGGAGTC CTGAGGCTTA TTAATGAACT TCATAACTCT GATATAGAAC AATTTATTGT TACGACCAGT GGAAAAGATT CTCTTGATCC TTTTCTAAAA ACCTCATTGA GTTCACATTT AAATTATTTT TCCGGATTTA TTACATATGA AGATGTAAGC AGGCACAAGC CCTTCCCTGA TGCATATAAG CTAGCGCTTA AATTAAGTAA GCAATCACAA TTTAATTGCA TAGCTATTGA GGACTCTAAG ATTGGCGTTG AATCTGCTAA GGCAGCTAAT CTTAATTGCC TTTTGATTTT GCCTCCGTGG AATAGTTCTA AGCAAAATAT TTCTAAAAAA GCTAATGCAT GTTTAAATAG CCTTGGTAAT TTTGATAATC CATCTAGGCT AATTTATGGT AAAAAGCTAA TTAGTGATCA TGTGGATTTT GATTATTTGA CAAATATTAT AAATTAA
|
Protein sequence | MHKLKAVFWD VDGTIADTEL CGHRVAFNLA FKDFDLDWNW NESQYLDLLK ISGGFNRIIH YRNKIDSDIT ESKCSEIQAR KRIHYKKLIQ SGKIKVREGV LRLINELHNS DIEQFIVTTS GKDSLDPFLK TSLSSHLNYF SGFITYEDVS RHKPFPDAYK LALKLSKQSQ FNCIAIEDSK IGVESAKAAN LNCLLILPPW NSSKQNISKK ANACLNSLGN FDNPSRLIYG KKLISDHVDF DYLTNIIN
|
| |