Gene NATL1_19051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_19051 
SymbollraI 
ID4779858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1567144 
End bp1568673 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content38% 
IMG OID640085195 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001015725 
Protein GI124026610 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATT TTTCAAATCA GTCCAAAAAG GTTAAACCCA TAATCAATAA AACAGTTCTC 
AAAAGTTCTC TTGTTGCTGG AGCATTTTTA TTTTCTGGAA TCAATCAGAC AGCGCAAGCA
AATACAAAAT CAATTGTTGC TGTAGAGCCA TTGGTTTGCG ATGTTGTATC TGCTATTGCA
CCACCCTCTA CGCCCGTAAC CTGCTTAATT GACAGAAAGC AAGATGTTCA TGATATCAAG
ATCACTCCAA GGCAAGCTCA AACACTAAAA AGTGCGAATC AAGTATTTAC TCTTGGTTCA
GAGATGACCC CTGCAATTAA AAAATGGTTG GATAATCCCT TAACTGTTGT CGTTGGTGTA
AGTGCAATAG AAATAGACGA TCATGACGAC CACGATGATC ATGACGATCA TTCAGCTGCT
AAGCATGATG ATCATGACGA CCACGACGAT CATTCAGCTG CTAAGCATGA TGATCATGAC
GATCACGATG ATCATGGCGA TGCCCATGGA GAGGGAGCTT TTGAATGGGC TGGTGTTTTT
GATCTTTCCA CAGGAGTCTA CAAATGGTCT TTCGCCAAAG TTGATGGAGA CTATGCTGAT
CCTGCGATGA AAATGGTTAT TCTTAAGTCT GGTGATATTG AAGCATCAGA AGAGCTTGCT
AAAGAATTAT TAGGATCCAA AAATTCAGAA GTTAAGCGCA ATAATGACAA ACTTATTGCG
CAGGACAAAG CCTTCCTTCT TACATTTAAT GAAAAGAAAG ACATCACAAC ATTTACTGTA
GAAATCAAAA AATCTGGTAA ATACGCTTTC TTTACTGAGC ATATGCCGTT TGAGTTTGAA
GCCGATGAAC ATTTCTTTAA AGATGTTTCA GGCGACGATG TTGAACCGAT TGCCCAAGTA
CCAGATGAAG GAGATCATCA TCACCATGAC CATGGAGGCT TAGATCCTCA TATCTGGCAT
GATCCACATA ACATCATCAA GATGGGAAAT GTAATTTCTA AAAATATCAA CAAGAAGATT
TCATTTTTTG ATAGAGAGAC TAAAAAAGTT TTAAAAGAAA GAACTCAATC TGTAAATTCC
ATTTTGGAAG ATCTAGATCA ATGGACTCAA GAACAAATAG CTACTATTCC TTCTGATCAA
AGGACGATGG TTTCTAAGCA CAAAGCCATG GAATATTACG GAGATGCATT TGGATTGAAG
ACCATGAGCC TACTAGATTT TCTTGGTGAT TCATCCAGCC TTAGGCCTCA AACTATTTCA
ACTGTATTAG CTGAGCTTAA AGAAGAAAAC GTGAAAGTTT TATTCGCTGA GCAAAAGCCT
CCTTCAAAGC TATTGAGGAA CCTCAGTAGA CAAACTTCCA CTCCTATCGC ATCAAATCAA
ATCTATGTTG ACGGTCTAAT GCCAACAGGG AATACTGTTT CAGTTGCTGT ACATAACACC
TGCACAATTG TTAATTCACT TGGTGGAGAA TGTGATGAGC AAGAGGGCGA TGAACTTGAG
GGGAAATGGA ATTCTTTAAT TAATCCTTAA
 
Protein sequence
MLNFSNQSKK VKPIINKTVL KSSLVAGAFL FSGINQTAQA NTKSIVAVEP LVCDVVSAIA 
PPSTPVTCLI DRKQDVHDIK ITPRQAQTLK SANQVFTLGS EMTPAIKKWL DNPLTVVVGV
SAIEIDDHDD HDDHDDHSAA KHDDHDDHDD HSAAKHDDHD DHDDHGDAHG EGAFEWAGVF
DLSTGVYKWS FAKVDGDYAD PAMKMVILKS GDIEASEELA KELLGSKNSE VKRNNDKLIA
QDKAFLLTFN EKKDITTFTV EIKKSGKYAF FTEHMPFEFE ADEHFFKDVS GDDVEPIAQV
PDEGDHHHHD HGGLDPHIWH DPHNIIKMGN VISKNINKKI SFFDRETKKV LKERTQSVNS
ILEDLDQWTQ EQIATIPSDQ RTMVSKHKAM EYYGDAFGLK TMSLLDFLGD SSSLRPQTIS
TVLAELKEEN VKVLFAEQKP PSKLLRNLSR QTSTPIASNQ IYVDGLMPTG NTVSVAVHNT
CTIVNSLGGE CDEQEGDELE GKWNSLINP