Gene NATL1_10941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_10941 
Symbol 
ID4779316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp999598 
End bp1000638 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content35% 
IMG OID640084373 
Producthypothetical protein 
Protein accessionYP_001014917 
Protein GI124025801 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00275534 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAATCTGA TAAAAACAAA TAAATTTAGA GGCTATTTAA AAGTTTGGGC AGCGCCAATA 
TCTTTACTCA TATTCTTATT TTTATTCGGT GCTTTAGGCT ATCGATTTAC AGAAGGTTGG
GATTGGGGAG ACTGTTTATG GATGGTACTG ATAACCATTA CAACAATAGG CTTTGGTGAA
GTTGAAGTTT TGAGTTCGGC TGGGCGGGTT ATAACTTTTT TAATCATCGG AGGGGGATTA
TTTGTAGTTC AATTAACTCT TCAAAGATTT ATACAATTAT CTGAACTAGG ATATTTCATA
AAATTAGAGG AACTCCGATT AAGGAGATTA ATTAGAAGAA TGAAAAATCA TGTAATTATA
TGTGGATATG GTCGTACAGG TAAAGAAATT GCTGACCAAT TATATTCTGA AAAAATATCT
ACACTAATAA TAGAAAACGA CCCTACAAGA AAAACCGAAG CTGAGGAAAA AGGATTTAAC
GTCTTATTAG CAGATGCAAC AATGGACGAA ACATTATTAC TAGCAGGAGT AAGAAATTGT
CGTAGCTTAG TGGTTACTCT TCCAAATGAT GCAGCGAATT TATATGTTGT TCTTAGTGCA
AAAGCTCTAA ATAATAGTTG CAGATTGATC GCTAGGGCAG CGAACGAGGA AGCTGCTAGT
AAATTAAAAC TTGCAGGAGC TGATGCGGTA GTCAGTCCAT ATGTTGCGGC AGGGAGAACA
ATGGCAGCCT CTGCATTAAG ACCTATAGCT GTGGACTTTA TAGATTTACT TGCAGGTTCA
GATTGCGAAA TAGAAGAATT TAAGCTTACT GAACATGTTG AAACAATTGA GACTTTCAGA
AGTCAACATG AATATGTTTT TGAAATTTCA AAACGAGGCG AAGCACTACT TTTAGCAACT
AAAGTCTCGG GTCAATTAAT AGGTAATCCT AAAAACAAGG TATCTATCTC TCCAGGCATG
ATTTTGATAT TCCTTGGAAG TCAAGAGCAA TTAAACAGGA TTAGAGTTCA CCTTAAAGAA
GTTTTAGTAA AAACAACATA G
 
Protein sequence
MNLIKTNKFR GYLKVWAAPI SLLIFLFLFG ALGYRFTEGW DWGDCLWMVL ITITTIGFGE 
VEVLSSAGRV ITFLIIGGGL FVVQLTLQRF IQLSELGYFI KLEELRLRRL IRRMKNHVII
CGYGRTGKEI ADQLYSEKIS TLIIENDPTR KTEAEEKGFN VLLADATMDE TLLLAGVRNC
RSLVVTLPND AANLYVVLSA KALNNSCRLI ARAANEEAAS KLKLAGADAV VSPYVAAGRT
MAASALRPIA VDFIDLLAGS DCEIEEFKLT EHVETIETFR SQHEYVFEIS KRGEALLLAT
KVSGQLIGNP KNKVSISPGM ILIFLGSQEQ LNRIRVHLKE VLVKTT