Gene NATL1_21401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21401 
Symbol 
ID4780066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1797685 
End bp1798950 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content35% 
IMG OID640085437 
Productcystathionine beta-lyase family aluminum resistance protein 
Protein accessionYP_001015960 
Protein GI124026845 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4100] Cystathionine beta-lyase family protein involved in aluminum resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAA ATTCTGCTAA AAATTTTGTT GAAAATATAG AAAAGAAATT ATATCTATCG 
ATTAAAGAAA AGACTGATAG CATAACATTT AAGTTAGAAA AAGTTTTAAA GGCTTTTTCT
GATTCTCATC TAAATGTACA ACATTTTGCA TCTTTAACAG GTTATGGACA TGGAGATATA
GGGAGAGACA TAATTGATAA AATTTTTGCA AATGTTTTAG ATGCTGAAAA AGCAGCGGTA
AGGCTTCAGC TTGTCAGCGG TACACATGCT ATTACATCTT CTTTGTTTGG AGTCTTAAGG
CCAGGTGATG GCTTATTATC TGTTGCTGGA AGGCCTTATG AATCACTTGA AGAAGTGATT
GGATTGCGGG GAAATGGTCA AGGATCTTTA ATTGAGTTTG GTATTACATA TGAAGAAATA
TCTTTAAAAA ATGATGGAAA TATAGACTTT TTAGCTTTGG AGAAAGCTTT AAATATACCT
AGAAAATTAA TTTTTATACA GCGTAGTTGC GGTTATACGT GGCGTCCATC TTTGAGTATT
GAGGTTATTA AAGAGATTTG TTATTTATGT CATAAAATTC AGCCTAATTG TATTTGCTTT
GTTGATAATT GTTATGGAGA ATTTGTCGAA ACGAATGAGC CAACGTCAGT AGGTGCAGAT
TTAATCGCAG GTTCTTTAAT AAAAAATTTA GGTGGAACTA TTGTTCCTAC TGGTGGATAT
ATTGCGGGGA AAGCTGATTT AGTTGATAAA GCATGTTGTC GCTTGACTGC TCCAGGTATT
GGATCTGAGG GCGGTATTAC TTTTGACTTA AATAGAACTA TTTTACAAGG ACTTTTTCTG
GCGCCTCAAA TGGTTTCAGA GGCACTAATA GGGGCTGAAA TTATTTCCAC TTCATTTAGC
GAATTGGGCT TTAAAGTATT GCCAACTCCT GCTTCTAATC GCACTGATTT GATCCAAATA
GTTAGAATCG GCGATCCAAA AATCTTACAA ATTATTTGTA GGTCTTTTCA AGAAAAATCA
CCTATTGGAT CTTTTCTCGA TCCAATACCT GCTCCAATGC CAGGGTATGA AAATAATTTA
GTCATGGCTG GAGGAACTTT TGTAGATGGT AGTACGAGTG AATTTTCTGC TGATGCTCCT
ATGAAACCTC CATTTGATCT ATTTATTCAA GGAGGATCTC ATCGCGCTCA TGTCAAAATT
GCTTTGATTC ATGCATTATC CAATTTATTT CAGGCAGGAT TGATCAAATT ACCCCAGAAT
GATTAA
 
Protein sequence
MLKNSAKNFV ENIEKKLYLS IKEKTDSITF KLEKVLKAFS DSHLNVQHFA SLTGYGHGDI 
GRDIIDKIFA NVLDAEKAAV RLQLVSGTHA ITSSLFGVLR PGDGLLSVAG RPYESLEEVI
GLRGNGQGSL IEFGITYEEI SLKNDGNIDF LALEKALNIP RKLIFIQRSC GYTWRPSLSI
EVIKEICYLC HKIQPNCICF VDNCYGEFVE TNEPTSVGAD LIAGSLIKNL GGTIVPTGGY
IAGKADLVDK ACCRLTAPGI GSEGGITFDL NRTILQGLFL APQMVSEALI GAEIISTSFS
ELGFKVLPTP ASNRTDLIQI VRIGDPKILQ IICRSFQEKS PIGSFLDPIP APMPGYENNL
VMAGGTFVDG STSEFSADAP MKPPFDLFIQ GGSHRAHVKI ALIHALSNLF QAGLIKLPQN
D