Gene NATL1_01481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01481 
SymbolcitT 
ID4780027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp145799 
End bp147616 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content40% 
IMG OID640083412 
ProductDASS family sodium/sulfate transporter 
Protein accessionYP_001013977 
Protein GI124024861 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGA TATCCACTGT CTTAGATAAT CCCAAAGCTG TCATAACTTT GGCAGTGTTA 
ATTATTGCTG TTTTTTTGTT CGTAAGCAGT GCTCTAGCTC CTGAGCTGAC TGGACTTTTG
AGCGTTGCAT TATTGATGGC TACAGGAGTT CTCTCCCCTC AAAAAGCATT GGCTGGTTTT
GGAAGCCCAG CGTTGATTAC ATTGATGGGT TTATTTGCTG TTTCTGCTGC ACTTTTTAAA
AGTGGTGCCC TTGATCGTTT AAGGGAGTTA ATTGCTTCTG AAAGTATTCG AACGCCGCGC
AGATTGATAG CTTTGCTCGG ATTAGTTGTT GCTCCTGTCT CAGGTGTTGT TCCCAATACC
CCTGTGGTTG CTTCACTTTT ACCAGTAATC GAAGCTTGGT GCGTCAAGCG AAAACTATCT
CCATCACGTG TATTGTTACC ATTATCTTTC GCGACAGTAT TAGGTGGCAC TTTAACTCTT
TTAGGCAGTT CAGTGAATTT GTTGGTCAGC GATATTAGTG ATCAACTTGG TTACGGTCCC
TTTGATTTAT TTACTTTTAC AGCAATAGGA GTACCTATAT GGTTATTTGG GACAGCATAT
ATGTTGCTGG CTCCACAATC TCTATTGCCA GATAGAGGGA GAATTAGCTC AGAGTATGGA
GGAAGTTCGG ATCAAACTGG TTATTTTACC GAAGTCACAA TTCCTTCTGA TTCTGAACTT
GTTGGACATT CATTGAGAAA TAGCCGCTTA CAAAGAAGAT TTGATGTTGA TGTTCTAGAA
ATACAGCGAG AAAATGAAAT ACTTTTACCC CCTTTAGCCG ATCGGATAAT TCAGTCTGGT
GATCGGTTGT TGATAAGAAT TACGCGCTCT GATCTTTTGC GTTTAAAACA AGAACATACT
GTTCAGTTAA ATAAAAACTT AAATATTGAA AAAAATTTTT TTCTGTCCAA CATTGATGAA
AGTCAACAAA CGGTAGAGGT TCTTTTGCCA GCTGGTTCAA CCTTGGCTGG GGCAAGTTTG
CGTGAACTGC GATTTAGACA AAGACACAAT GCTACTGTTT TGGCCTTACG ACGGGGTCAG
CAAACTGTTC AAGAGAGATT GGGACAAGCA ATTTTGAGAG AAGGAGACGT ATTACTCTTA
CAAGCTCCAA GAGATTCAAT TCGTGGATTA CAGGATAGTA ATGACCTTCT TGTTTTAGAT
CAATTTGAGA ATGATCTACC CACTGTTACA AGAAAACCGA TAGCAATTGG CATCGCAATC
GCAATGTTGA TTATTCCCTC AATTACTGAC TTGCCTCTAG TAGCTTCAGT TCTAATGGCA
GTGATTGCTA TGGTCTGGGG AGGATGTTTA AGACCCGCAG AGGTACAAAG ATCAATCCGA
CTTGACGTCA TTCTTTTGCT GGGATCTCTT TCAAGTTTTA GTGTCGCAAT GCAGACCACT
GGTCTTGCTG ATGCTTTTGC CAATATTTTA ATTTTAATAT TGAAAGACTT ATCTACTTAT
TCCGCCTTAC TAGTAATTTT CCTCTCTACT ACAATATTTA CTCAATTTGT TAGTAATGCT
GCTTCAGTGG CTCTTTTGGC GCCAATTGCT GTTCAATTGG CCCCAAGTAT GGGGCTACCT
CCTTTGGCTT TATTGATAAC AGTTCTTTTT GGCGCCAGTC AATCTTTTCT TACTCCTATG
GGTTACCAGA CAAACCTTAT GGTATTTGGC CCTGGGCGTT ATCAGTTTTT AGATGTCACC
AGATACGGAG CTGGGTTGAC CATCTTGATG ACGCTTCTTG TCCCTGGGTT GATATTGCTA
AAGTATGGTA TTTCTTAA
 
Protein sequence
MNEISTVLDN PKAVITLAVL IIAVFLFVSS ALAPELTGLL SVALLMATGV LSPQKALAGF 
GSPALITLMG LFAVSAALFK SGALDRLREL IASESIRTPR RLIALLGLVV APVSGVVPNT
PVVASLLPVI EAWCVKRKLS PSRVLLPLSF ATVLGGTLTL LGSSVNLLVS DISDQLGYGP
FDLFTFTAIG VPIWLFGTAY MLLAPQSLLP DRGRISSEYG GSSDQTGYFT EVTIPSDSEL
VGHSLRNSRL QRRFDVDVLE IQRENEILLP PLADRIIQSG DRLLIRITRS DLLRLKQEHT
VQLNKNLNIE KNFFLSNIDE SQQTVEVLLP AGSTLAGASL RELRFRQRHN ATVLALRRGQ
QTVQERLGQA ILREGDVLLL QAPRDSIRGL QDSNDLLVLD QFENDLPTVT RKPIAIGIAI
AMLIIPSITD LPLVASVLMA VIAMVWGGCL RPAEVQRSIR LDVILLLGSL SSFSVAMQTT
GLADAFANIL ILILKDLSTY SALLVIFLST TIFTQFVSNA ASVALLAPIA VQLAPSMGLP
PLALLITVLF GASQSFLTPM GYQTNLMVFG PGRYQFLDVT RYGAGLTILM TLLVPGLILL
KYGIS