Gene NATL1_21821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21821 
SymboleriC 
ID4780285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1841752 
End bp1843110 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content36% 
IMG OID640085480 
Productputative chloride channel 
Protein accessionYP_001016002 
Protein GI124026887 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0038] Chloride channel protein EriC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.607028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAA GTAAAAGTAA AAATCCAATA CAAAAAAAAT CAACTCAAAG CATAAAAAAA 
CTTCTACAGA GGAAATGGCT AAATGTAATT CTTGCTCTCA TACTTACAGG TTTAGGAGCA
GCATTAACGG GAATATTATT TAAAACTGGG ATTCATACAT TAGAAGATTA TCGATCGAAT
CTTCTTGCAT CTATGCCTAG ATGGATAGTT TTACCAATAC TAGGAGCATT GGGAGGCTTG
ATTTCTGGAT CGCTCATTCA AAATTTTGCC CCTGCAGCAA AAGGAGCGGG CGTAAGTCAC
ATCATTGCTT TCCTTCGCCA TAAGCCAGTC CCCATGGGAT TAAGAGTTGG AATAGTAAAA
CTGTTTGCTG GAATTATTGC TATTGGAAGT GGATTTCCGC TAGGCCCAGA AGGCCCGGCA
GTACAAATGG GAGGATCAGT TGCCTGGAAA ATGGCAAAAT GGCTCAAAGC CCCTATTTCA
TTTCGCAGAG TCATAGTTGC AGCAGGCGGA GGAGCTGGAA TTGCAGCAAT TTTTAGTGCG
CCTATTGGAG GCTTTGTCTA TGCAATTGAA GAACTTCTAA ATTCAGCCAG ACCAGTAGTT
CTCTTACTAG TAGTAGTAAC AACTTTCTGG GCTGATAGTT GCGCTGATAT TTTGCAAGCG
ATTGGACTAG ATCAAAAAGC TGGTGGATTT GTTAATAATT TAGGTTTTCA ATTAGAGAGA
GCTTACACAC CAGTAATCGA ATTCTTTCCT ATAGATTTTC TATACCTAAT TTTTTTAGGA
ATTGTATTAG GATTATTAGC AGAAATGTAT TGCCGATATG TTTTAAAAAT GCAAGTTCTA
GGAAATAAAT GGTTTAAAAA TAAAACCATT GAAAGAATGA GTTTATCGGG TTTATTGCTT
GGTACTATGT ATTCAATTAT TCCGGAAAAT TTTCACAATA TCGAAGGATT ACAAAAAATT
ATTGTTAATG AAAGTAGCAA TTTCACACTT GCAATAAGTA TATTTTTTAT TTTATTCTTT
GCCTCTGGCC TTGCTGCTGC TTCAGGTGCA CCAGGAGGTT TATTTTATCC AATGCTTACC
TTGGGTGGAG CAATAGGTTT AGCAAGTGGT ATTGGAGTCG AGACAATAAC TGGGCATGTT
CCAACAACCT ATATTTTCGC AGGCATGGGG AGATTCGTAG CGGGATGCTC AAGAACGCCT
CTAACAGCTA TGTTTTTAGC TTTCGCATTA ACTAAAAATC TTTTAATACT AAAACCACTC
TTAATTACAT GTATAGCTAG CTTTTTAACT GCCAGAATAT TTAATGAACA CTCAATTTAT
GAAAGACAAA TTACAATTGA AGAAGGAGAA TTAATTTAA
 
Protein sequence
MIKSKSKNPI QKKSTQSIKK LLQRKWLNVI LALILTGLGA ALTGILFKTG IHTLEDYRSN 
LLASMPRWIV LPILGALGGL ISGSLIQNFA PAAKGAGVSH IIAFLRHKPV PMGLRVGIVK
LFAGIIAIGS GFPLGPEGPA VQMGGSVAWK MAKWLKAPIS FRRVIVAAGG GAGIAAIFSA
PIGGFVYAIE ELLNSARPVV LLLVVVTTFW ADSCADILQA IGLDQKAGGF VNNLGFQLER
AYTPVIEFFP IDFLYLIFLG IVLGLLAEMY CRYVLKMQVL GNKWFKNKTI ERMSLSGLLL
GTMYSIIPEN FHNIEGLQKI IVNESSNFTL AISIFFILFF ASGLAAASGA PGGLFYPMLT
LGGAIGLASG IGVETITGHV PTTYIFAGMG RFVAGCSRTP LTAMFLAFAL TKNLLILKPL
LITCIASFLT ARIFNEHSIY ERQITIEEGE LI