Gene NATL1_02341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02341 
SymbolgltA 
ID4779542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp216876 
End bp218003 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content41% 
IMG OID640083499 
Productcitrate synthase 
Protein accessionYP_001014063 
Protein GI124024947 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTCTAA AGCCAGGCTT GGAAGGAGTT CCAGTAACCA AATCAGGGAT ATGCGAAATA 
AATGGAACAG AAGGAAGGCT TAGCTACAGA GGTTATCCAA TATCTGAGCT AGCCCAAAAG
AGTAGTTTTT TAGAAACTGC ATTTCTCCTG ATCTGGGGAG AACTTCCCAC TGAAAATGAG
CTTGAGAAAT TTGAAAAGGA CGTTCAAATG CATAGACGAG TGAGCTTTAG AATTAGAGAT
ATGCTCAAGT GTTTTCCAGA GTCTGGGCAT CCTATGGATG CTCTTCAAGC AAGTGCTGCA
TCTTTGGGCC TCTTTTATTC TCGAAGAGCA ATTGATGATC CAAAATATAT CTACGACGCA
GTAGTGAGGT TGATTGCAAA AATTCCAACT ATGGTTGCTG CTTTCGAGCA AATAAGAAAA
GGAGACGATC CAATTCAACC GCAAGATGAT TTACCTTACT CTTCCAATTT CCTTTATATG
CTCACCGAGA GGGAGCCAAA TCCTCTTGCA GCAAGAGTTT TCGACAGATG TTTAATTCTT
CATGCCGAGC ACAGTCTCAA TGCAAGTACG TTTAGCGCAA GAGTAACTGC AAGCACATTG
ACTGATCCTT ATGCCGTCGT CGCTTCTGCC GTTGGGACAT TAGCCGGTCC TCTTCATGGA
GGGGCCAATG AAGATGTGAT AGCAATGCTA GAAGAAATTG GAAGGCCTGA TGAAGCCTCT
TCATTTCTCA ATGATGCAAT TGCAAAAAAA AGGAAAATCA TGGGCTTTGG ACACAGGGAA
TATCGTGTCA AAGACCCTAG AGCAACAATT TTACAAGCCT TCGCAGAGGA ACTTTTCTCG
GAATTTGGTA AAGATGAAAT GTATGAAGTA GCCAAAGCAC TTGAAGAAGA AGCTATTTCC
AAGCTGGGGC CAAAAGGTAT ATTCCCAAAT GTTGACTTTT ATTCCGGACT TGTTTATCGA
AAGCTAGGTA TTCCTCGAGA TTTATTTACA CCAGTTTTTG CTATTTCCAG AGTTGCTGGT
TGGTTGGCTC ACTGGAGAGA ACAACTTGGA GCAAATAGAA TTTTCAGACC ATCACAAATT
TATGAAGGAG CAAAAATGAG AAATTGGAAG CCTCTTGAAA GCAGATAA
 
Protein sequence
MVLKPGLEGV PVTKSGICEI NGTEGRLSYR GYPISELAQK SSFLETAFLL IWGELPTENE 
LEKFEKDVQM HRRVSFRIRD MLKCFPESGH PMDALQASAA SLGLFYSRRA IDDPKYIYDA
VVRLIAKIPT MVAAFEQIRK GDDPIQPQDD LPYSSNFLYM LTEREPNPLA ARVFDRCLIL
HAEHSLNAST FSARVTASTL TDPYAVVASA VGTLAGPLHG GANEDVIAML EEIGRPDEAS
SFLNDAIAKK RKIMGFGHRE YRVKDPRATI LQAFAEELFS EFGKDEMYEV AKALEEEAIS
KLGPKGIFPN VDFYSGLVYR KLGIPRDLFT PVFAISRVAG WLAHWREQLG ANRIFRPSQI
YEGAKMRNWK PLESR