Gene NATL1_02681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02681 
Symbol 
ID4779472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp247047 
End bp248651 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content34% 
IMG OID640083533 
ProductABC transporter ATP-binding protein 
Protein accessionYP_001014097 
Protein GI124024981 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT GCTCAAAAGA AGTTTTAAAG ATAAATAAAC TTAATGCTAT TTATCCAAAT 
AGTACTACTT ACGTAATAAA TGGATTAAAT CTGAAGATGA ATCGTGGGGA TAGGCTTGCA
TTGGTTGGTA GTTCAGGTTG TGGCAAAAGT ACTGTTGCTA AGGCCATAAT GCAGCTTCTT
CCAGACGGAA GTACTTGTTA TGGTGAGATT TTTTTAAATG GAAAAAATGT ATTAAAATTA
GACGAGGATT CTTTGCCAAC TATTCGAGGA AAAGAGGTTG GATTGATATT TCAGGATCCA
ATGTCACGGT TGAATCCATT AATGACTGTT GGTGATCATA TAGTTGATAC TTTTAAAGCT
CATGATAATT CTGAACCAAT TTATAACTTA GTAAAAAAAG CTAAAAGCTT ATTAGAAAAA
GTTGGAATAG ATCCTTTAAG GTTTAATTCT TTTCCACATG AATTTAGTGG CGGAATGCGA
CAACGCGTAG CTATTGCTTT GGCAATTTGT TTAAGACCTC CTTTGATAAT TGCTGATGAA
CCTACTACTA GCTTAGATAC AATAGTTGCT GATCAAATTA TGAGTGAATT GAGTTTACTT
TGTGATGAGA TTGGGACTGC TTTACTATTA ATTAGTCATG ATTTATCTAT GGCATATAAA
TGGTGTAATA AAATTGCGAT ACTTGATTGT GGGAAAATAG TAGAATCTGG AAATATTAAA
CAGATAATTG GTGATCCTAA AACTAATATT GCTCAAAAAT TAGTGGAGTC AGGAAGGTTA
TTAGAGGGTT CTGAGAGAAA GTTAATTAAT AAAAGTACTG TACTATTGAG CGTAAATAGA
TTACGTTGCT GGCATGATTT AGGCTTCTGG CCCTTCAATT CTTTTTGGTT GAAAGCTGTT
AATGAAGTTA CCTTTTCTCT GTATGAAGGT GAAACTCTTG GTATCGTAGG CCCATCAGGA
TGTGGAAAGA GTACTCTCTG TAGAGCGTTG ACTGGTTTAT TACCTACAAG AGGTGGAAGT
GTTCTTTTTC TTGGAAAGAA TATTTCAATT ATCAATAGGA AATCTTTAAA ACAATTACGT
AAATATATTC AAATTATTTT TCAAGATCCT TCTGCTTCTT TGAACCCTAA GATGTCAGTA
CTGGATGCAA TTATAGATCC AATTCTTATT CATAAATTGT TAAGTCGATC TCAAGCCAGA
GAAAAAGCTC GTAATCTTTT AGACCTAGTT GGTTTAGTGC CCACAGCAAT GTATGAACAA
CGCTTACCTT CTCAACTTTC AGGAGGACAA CAGCAAAGAG TAGCTATTGC AAGAGCACTT
GCACTTAGTC CCAAAATACT TATCTGTGAT GAAAGTGTCA GTATGTTAGA TGCAGAAATT
CAAGCAGAAG TCTTAGAGTT GCTACGCTCT TTGCAAGAAA AATTGAAACT GTCTATGTTG
TTCATAACTC ATGATTTATC TGTTGCAGCA GGCTTTTGTC ACAGAGTATT AGTGTTAGAT
AAAGGGAAAA TTATCGAAGA AAATTTTGGT AAAAATTTGT TAAATGATCC TCAAAAATAC
TTAACAAAAA AGATGGTTAA GGCATGTCCT AGACTTCCGA ATTAA
 
Protein sequence
MNNCSKEVLK INKLNAIYPN STTYVINGLN LKMNRGDRLA LVGSSGCGKS TVAKAIMQLL 
PDGSTCYGEI FLNGKNVLKL DEDSLPTIRG KEVGLIFQDP MSRLNPLMTV GDHIVDTFKA
HDNSEPIYNL VKKAKSLLEK VGIDPLRFNS FPHEFSGGMR QRVAIALAIC LRPPLIIADE
PTTSLDTIVA DQIMSELSLL CDEIGTALLL ISHDLSMAYK WCNKIAILDC GKIVESGNIK
QIIGDPKTNI AQKLVESGRL LEGSERKLIN KSTVLLSVNR LRCWHDLGFW PFNSFWLKAV
NEVTFSLYEG ETLGIVGPSG CGKSTLCRAL TGLLPTRGGS VLFLGKNISI INRKSLKQLR
KYIQIIFQDP SASLNPKMSV LDAIIDPILI HKLLSRSQAR EKARNLLDLV GLVPTAMYEQ
RLPSQLSGGQ QQRVAIARAL ALSPKILICD ESVSMLDAEI QAEVLELLRS LQEKLKLSML
FITHDLSVAA GFCHRVLVLD KGKIIEENFG KNLLNDPQKY LTKKMVKACP RLPN