Gene NATL1_21631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21631 
SymbolpyrG 
ID4779795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1820216 
End bp1821883 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content39% 
IMG OID640085461 
ProductCTP synthetase 
Protein accessionYP_001015983 
Protein GI124026868 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.397558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAT TTGTTTTCGT CACTGGTGGT GTAGTTTCAA GCATTGGCAA AGGAATTGTT 
GCCGCAAGTC TTGGCAGACT ACTTAAATCA AAAGGGTACA GTGTTTCGAT TTTGAAGCTT
GACCCATATT TAAATGTTGA TCCTGGGACG ATGAGCCCAT TTCAACATGG AGAAGTTTTT
GTAACTGAAG ACGGGGCTGA AACTGATTTG GACCTTGGGC ATTATGAGCG CTTTACAGAC
ACTGCAATGT CAAGACTAAA CAGTGTCACG ACAGGTTCCA TTTATCAAGC TGTTATTAAT
AAAGAAAGAA GAGGTGACTA TGACGGAAGG ACGGTTCAAG TAATCCCCCA CATCACTCGT
GAAATTCGCG AAAGAATTAA ACGTGTAGCA AACAATAGTG GTGCAGATGT AGTGATATCA
GAAATTGGAG GAACAGTTGG AGATATTGAA TCTCTACCTT TTCTTGAAGC AATAAGAGAA
TTTAAAGGAG ATGTAAAAAG AAACGATGTT GTTTATGTTC ACGTCACATT ATTGCCTTAC
ATAGGTACAT CTGGAGAAAT TAAAACCAAG CCAACACAGC ACTCAGTAAA AGAATTGCGT
TCTATAGGTA TTCAGCCAGA TATTTTAGTT TGTCGCAGTG ATAGGCCTAT TAATGATGAA
TTAAAAAATA AAATAGGGGG TTTCTGTGGA GTTAACTCTG AAGCGGTCAT AGCATCCTTA
GATGCTGACA GTATTTACTC AGTACCATTA GCGCTCAAAG ATGAGGGACT TTGCAAGGAA
GTATTAGATT GCTTGGATCT AAATGATCAT GAAAGTGATT TAAAAGACTG GGAACGATTA
GTCCATAAAT TGCGCAATCC AGGTCCTTCT GTCAAAGTTG CATTAGTTGG CAAATATGTA
CAATTAAATG ATGCCTACTT ATCAGTAGTC GAAGCATTAC GTCACGCATG TATCTCGCAT
GATGCGTCTT TAGATCTTCA CTGGATCAAT GCGGAAAACA TAGAATCCGA GGGAGCTGAA
AAACTACTGC AGGGAATGGA CGCAATTGTC GTTCCAGGAG GTTTTGGTAA CCGTGGCGTC
AATGGGAAAA TAGCTGCTAT TAGATGGGCA AGGGAGCAAA GAGTACCTTT TTTAGGGCTT
TGCTTAGGTA TGCAATGTGC TGTCATTGAA TGGGCTCGCA ATATCGCTGG TTTAGAAGAT
GCATCTAGTG CCGAACTAAA TCCGAATTCA AAACACCCAG TAATTCACTT ACTTCCAGAA
CAACAAGACG TCGTTGATCT AGGAGGAACA ATGAGATTAG GCGTATATCC TTGTAGACTT
CAAGCAAACA CAACAGGACA AAGTTTATAC AACGAAGAAG TTGTTTACGA AAGGCATAGA
CATCGTTATG AATTCAATAA TTCATACAGA ACTCTACTAA TGGAATCTGG GTATGTAATT
AGTGGTACTT CACCTGATGG TCGATTAGTA GAGTTAATAG AATTAAAAAA TCACCCATTT
TTTATTGCAT GTCAATATCA TCCAGAATTC CTCTCTAGAC CAGGAAAACC ACACCCTTTA
TTTGGTGGTT TGATTCAAGC CGCACAAATA CGTGTACCAT CCTCACCGAG TGAGGCATTT
AATCCTCAAT CAAAAATTAT TGAAAAAAAA TCACTAGAAC AGCAATAG
 
Protein sequence
MAKFVFVTGG VVSSIGKGIV AASLGRLLKS KGYSVSILKL DPYLNVDPGT MSPFQHGEVF 
VTEDGAETDL DLGHYERFTD TAMSRLNSVT TGSIYQAVIN KERRGDYDGR TVQVIPHITR
EIRERIKRVA NNSGADVVIS EIGGTVGDIE SLPFLEAIRE FKGDVKRNDV VYVHVTLLPY
IGTSGEIKTK PTQHSVKELR SIGIQPDILV CRSDRPINDE LKNKIGGFCG VNSEAVIASL
DADSIYSVPL ALKDEGLCKE VLDCLDLNDH ESDLKDWERL VHKLRNPGPS VKVALVGKYV
QLNDAYLSVV EALRHACISH DASLDLHWIN AENIESEGAE KLLQGMDAIV VPGGFGNRGV
NGKIAAIRWA REQRVPFLGL CLGMQCAVIE WARNIAGLED ASSAELNPNS KHPVIHLLPE
QQDVVDLGGT MRLGVYPCRL QANTTGQSLY NEEVVYERHR HRYEFNNSYR TLLMESGYVI
SGTSPDGRLV ELIELKNHPF FIACQYHPEF LSRPGKPHPL FGGLIQAAQI RVPSSPSEAF
NPQSKIIEKK SLEQQ