Gene NATL1_04191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04191 
Symbol 
ID4780814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp383719 
End bp385170 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content40% 
IMG OID640083689 
Productputative neutral invertase-like protein 
Protein accessionYP_001014248 
Protein GI124025132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.659248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAC GTTTTAGCCA ACAGAACCAA AGGGTACGCC CCAACTCGAA TGAAGACAAA 
GTTGTTGCAA GAGCTAAAGA ACATTTTGAA AAAACTCTGA TTCAAATTTC TGGAGATATT
GCAGGTAGTG TCGCAGCCTT AGAGCATCCA ACAAAAAATG ATGCTCTTAA TTACGGAGAA
ATCTTTTTAA GGGATAACGT CCCTGTAATG ATCTATCTCT TAACTCAAAA AAGATATGAC
ATAGTCAAAA AATTTTTGAC AGTAAGTCTC GATTTGCAAA GTACCACTTA TCAAACTCGA
GGGGTTTTCC CGACAAGTTT TGTTGAAGAA AAAGGGAAAT TAATAGCAGA TTATGGTCAG
AGGTCTATTG GAAGAATTAC CTCTGCAGAT GCAAGCTTAT GGTGGCCAGT ACTTTGTTGG
CTGTATGTGA GAAAAAGTGG TGATCAAAGT TTTGGGACGA GTCAACAAGT GCAAAGAGGT
GTGCAACTTC TTCTTGATTT AGTTTTGCAT CCAACCTTCG AGGGGAATCC AGTTTTATTC
GTGCCAGATT GTTCATTTAT GATCGATCGA CCCATGGATG TTTGGGGAGC TCCTCTTGAG
GTAGAGGTTT TGTTGCATGC ATCTTTAAAA AGCTGCATAC AACTAATGGA ACTAAGCAGA
AAACATCAAA AAAGTCGATT GCTCGATCAG AGGCTTGTTT TAACACGTCA GTGGGTGCAT
GATCTTCGCC AATTTCTTTT GAAGCATTAT TGGGTTACAA GCAAAACGAT GCAAGTTCTT
AGAAGAAGAC CTACAGAACA ATATGGTGAA GATCAACATC AAAATGAATT CAATGTCCAA
CCCCAAGTAG TCCCTTCATG GCTTCAGGAT TGGCTTGAAA ATAGAGGTGG ATACCTTATT
GGAAATATAA GAACTGGTAG GCCAGATTTT CGTTTTTACA GCTTGGGGAA TTCATTAGCA
TGCATGTTTG GAGTATTGAC AGCACCTCAA CAAAGAGCAT TGTTTCGTCT GGTTCTACAT
AACCGTGAAC ACCTCATGGC ACAAATGCCA ATGAGGATAT GCCATCCTCC AATGGATGTC
GAAGAATGGC AAAACAAAAC AGGTTCAGAC CCAAAAAACT GGCCATGGAG CTACCACAAT
GGGGGCCATT GGCCTAGTCT TTTATGGTTT TTTGGTGCTT CTATATTGCT TCATGAAAAA
CGATATCCAA AAGCTGATGT TTTACTCATG GGACAAATGA GGGCTCTTAT TGAAGAATGT
TATTGGAGTC AATTGAACCA ACTCCCTAGA CAAAAATGGG CAGAATATTT TGATGGGCCA
ACTGGCACTT GGGTAGGACA ACAATCTAGG ACTTATCAAA CATGGACTAT TGTTGGATTT
TTATTAATGC ATCATTTGTT AAGAGCTGAA CCTGATGATG TATTGATGCT GGACTTAGAG
GAAGAATTCT AA
 
Protein sequence
MPARFSQQNQ RVRPNSNEDK VVARAKEHFE KTLIQISGDI AGSVAALEHP TKNDALNYGE 
IFLRDNVPVM IYLLTQKRYD IVKKFLTVSL DLQSTTYQTR GVFPTSFVEE KGKLIADYGQ
RSIGRITSAD ASLWWPVLCW LYVRKSGDQS FGTSQQVQRG VQLLLDLVLH PTFEGNPVLF
VPDCSFMIDR PMDVWGAPLE VEVLLHASLK SCIQLMELSR KHQKSRLLDQ RLVLTRQWVH
DLRQFLLKHY WVTSKTMQVL RRRPTEQYGE DQHQNEFNVQ PQVVPSWLQD WLENRGGYLI
GNIRTGRPDF RFYSLGNSLA CMFGVLTAPQ QRALFRLVLH NREHLMAQMP MRICHPPMDV
EEWQNKTGSD PKNWPWSYHN GGHWPSLLWF FGASILLHEK RYPKADVLLM GQMRALIEEC
YWSQLNQLPR QKWAEYFDGP TGTWVGQQSR TYQTWTIVGF LLMHHLLRAE PDDVLMLDLE
EEF