Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04191 |
Symbol | |
ID | 4780814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 383719 |
End bp | 385170 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640083689 |
Product | putative neutral invertase-like protein |
Protein accession | YP_001014248 |
Protein GI | 124025132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.659248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCAC GTTTTAGCCA ACAGAACCAA AGGGTACGCC CCAACTCGAA TGAAGACAAA GTTGTTGCAA GAGCTAAAGA ACATTTTGAA AAAACTCTGA TTCAAATTTC TGGAGATATT GCAGGTAGTG TCGCAGCCTT AGAGCATCCA ACAAAAAATG ATGCTCTTAA TTACGGAGAA ATCTTTTTAA GGGATAACGT CCCTGTAATG ATCTATCTCT TAACTCAAAA AAGATATGAC ATAGTCAAAA AATTTTTGAC AGTAAGTCTC GATTTGCAAA GTACCACTTA TCAAACTCGA GGGGTTTTCC CGACAAGTTT TGTTGAAGAA AAAGGGAAAT TAATAGCAGA TTATGGTCAG AGGTCTATTG GAAGAATTAC CTCTGCAGAT GCAAGCTTAT GGTGGCCAGT ACTTTGTTGG CTGTATGTGA GAAAAAGTGG TGATCAAAGT TTTGGGACGA GTCAACAAGT GCAAAGAGGT GTGCAACTTC TTCTTGATTT AGTTTTGCAT CCAACCTTCG AGGGGAATCC AGTTTTATTC GTGCCAGATT GTTCATTTAT GATCGATCGA CCCATGGATG TTTGGGGAGC TCCTCTTGAG GTAGAGGTTT TGTTGCATGC ATCTTTAAAA AGCTGCATAC AACTAATGGA ACTAAGCAGA AAACATCAAA AAAGTCGATT GCTCGATCAG AGGCTTGTTT TAACACGTCA GTGGGTGCAT GATCTTCGCC AATTTCTTTT GAAGCATTAT TGGGTTACAA GCAAAACGAT GCAAGTTCTT AGAAGAAGAC CTACAGAACA ATATGGTGAA GATCAACATC AAAATGAATT CAATGTCCAA CCCCAAGTAG TCCCTTCATG GCTTCAGGAT TGGCTTGAAA ATAGAGGTGG ATACCTTATT GGAAATATAA GAACTGGTAG GCCAGATTTT CGTTTTTACA GCTTGGGGAA TTCATTAGCA TGCATGTTTG GAGTATTGAC AGCACCTCAA CAAAGAGCAT TGTTTCGTCT GGTTCTACAT AACCGTGAAC ACCTCATGGC ACAAATGCCA ATGAGGATAT GCCATCCTCC AATGGATGTC GAAGAATGGC AAAACAAAAC AGGTTCAGAC CCAAAAAACT GGCCATGGAG CTACCACAAT GGGGGCCATT GGCCTAGTCT TTTATGGTTT TTTGGTGCTT CTATATTGCT TCATGAAAAA CGATATCCAA AAGCTGATGT TTTACTCATG GGACAAATGA GGGCTCTTAT TGAAGAATGT TATTGGAGTC AATTGAACCA ACTCCCTAGA CAAAAATGGG CAGAATATTT TGATGGGCCA ACTGGCACTT GGGTAGGACA ACAATCTAGG ACTTATCAAA CATGGACTAT TGTTGGATTT TTATTAATGC ATCATTTGTT AAGAGCTGAA CCTGATGATG TATTGATGCT GGACTTAGAG GAAGAATTCT AA
|
Protein sequence | MPARFSQQNQ RVRPNSNEDK VVARAKEHFE KTLIQISGDI AGSVAALEHP TKNDALNYGE IFLRDNVPVM IYLLTQKRYD IVKKFLTVSL DLQSTTYQTR GVFPTSFVEE KGKLIADYGQ RSIGRITSAD ASLWWPVLCW LYVRKSGDQS FGTSQQVQRG VQLLLDLVLH PTFEGNPVLF VPDCSFMIDR PMDVWGAPLE VEVLLHASLK SCIQLMELSR KHQKSRLLDQ RLVLTRQWVH DLRQFLLKHY WVTSKTMQVL RRRPTEQYGE DQHQNEFNVQ PQVVPSWLQD WLENRGGYLI GNIRTGRPDF RFYSLGNSLA CMFGVLTAPQ QRALFRLVLH NREHLMAQMP MRICHPPMDV EEWQNKTGSD PKNWPWSYHN GGHWPSLLWF FGASILLHEK RYPKADVLLM GQMRALIEEC YWSQLNQLPR QKWAEYFDGP TGTWVGQQSR TYQTWTIVGF LLMHHLLRAE PDDVLMLDLE EEF
|
| |