Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_10941 |
Symbol | |
ID | 4779316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 999598 |
End bp | 1000638 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640084373 |
Product | hypothetical protein |
Protein accession | YP_001014917 |
Protein GI | 124025801 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0569] K+ transport systems, NAD-binding component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00275534 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAATCTGA TAAAAACAAA TAAATTTAGA GGCTATTTAA AAGTTTGGGC AGCGCCAATA TCTTTACTCA TATTCTTATT TTTATTCGGT GCTTTAGGCT ATCGATTTAC AGAAGGTTGG GATTGGGGAG ACTGTTTATG GATGGTACTG ATAACCATTA CAACAATAGG CTTTGGTGAA GTTGAAGTTT TGAGTTCGGC TGGGCGGGTT ATAACTTTTT TAATCATCGG AGGGGGATTA TTTGTAGTTC AATTAACTCT TCAAAGATTT ATACAATTAT CTGAACTAGG ATATTTCATA AAATTAGAGG AACTCCGATT AAGGAGATTA ATTAGAAGAA TGAAAAATCA TGTAATTATA TGTGGATATG GTCGTACAGG TAAAGAAATT GCTGACCAAT TATATTCTGA AAAAATATCT ACACTAATAA TAGAAAACGA CCCTACAAGA AAAACCGAAG CTGAGGAAAA AGGATTTAAC GTCTTATTAG CAGATGCAAC AATGGACGAA ACATTATTAC TAGCAGGAGT AAGAAATTGT CGTAGCTTAG TGGTTACTCT TCCAAATGAT GCAGCGAATT TATATGTTGT TCTTAGTGCA AAAGCTCTAA ATAATAGTTG CAGATTGATC GCTAGGGCAG CGAACGAGGA AGCTGCTAGT AAATTAAAAC TTGCAGGAGC TGATGCGGTA GTCAGTCCAT ATGTTGCGGC AGGGAGAACA ATGGCAGCCT CTGCATTAAG ACCTATAGCT GTGGACTTTA TAGATTTACT TGCAGGTTCA GATTGCGAAA TAGAAGAATT TAAGCTTACT GAACATGTTG AAACAATTGA GACTTTCAGA AGTCAACATG AATATGTTTT TGAAATTTCA AAACGAGGCG AAGCACTACT TTTAGCAACT AAAGTCTCGG GTCAATTAAT AGGTAATCCT AAAAACAAGG TATCTATCTC TCCAGGCATG ATTTTGATAT TCCTTGGAAG TCAAGAGCAA TTAAACAGGA TTAGAGTTCA CCTTAAAGAA GTTTTAGTAA AAACAACATA G
|
Protein sequence | MNLIKTNKFR GYLKVWAAPI SLLIFLFLFG ALGYRFTEGW DWGDCLWMVL ITITTIGFGE VEVLSSAGRV ITFLIIGGGL FVVQLTLQRF IQLSELGYFI KLEELRLRRL IRRMKNHVII CGYGRTGKEI ADQLYSEKIS TLIIENDPTR KTEAEEKGFN VLLADATMDE TLLLAGVRNC RSLVVTLPND AANLYVVLSA KALNNSCRLI ARAANEEAAS KLKLAGADAV VSPYVAAGRT MAASALRPIA VDFIDLLAGS DCEIEEFKLT EHVETIETFR SQHEYVFEIS KRGEALLLAT KVSGQLIGNP KNKVSISPGM ILIFLGSQEQ LNRIRVHLKE VLVKTT
|
| |