Gene Tery_2872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2872 
SymbolpyrG 
ID4244943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4474215 
End bp4475930 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content39% 
IMG OID638107921 
ProductCTP synthetase 
Protein accessionYP_722518 
Protein GI113476457 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0537097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAT TTGTATTCAT AACTGGTGGA GTAGTTTCCA GTATTGGTAA AGGAATTGTC 
GCAGCAAGCT TAGGCCGTTT GCTCAAATCA CGGGGTTATT CTGTTTCTAT TCTTAAACTA
GACCCCTATA TAAATGTAGA CCCAGGGACA ATGAGTCCCT TCCAACATGG AGAAGTATTC
GTTACAGAAG ATGGGGCAGA AACTGACCTA GACCTAGGCC ACTATGAACG ATTCACAGAT
ACCTCAATGT CTCGCCTTAA TAGTGTTACC CAAGGTTCGA TCTACCAGGC AGTAATTAAT
AAAGAAAGGC GAGGGGACTA TCAGGGAGGA ACTGTTCAGG TTATTCCTCA TATTACCAAT
GAAATCAAAG AAAGAATACA TAGAGTAGCC AAAAATGCTA ATCCAGATGT GGTAATTACA
GAAATAGGGG GAACTGTAGG AGATATCGAA TCTCAGCCAT TTTTAGAAGC AATTCGGCAG
TTTCGCAAAG ATGTAAGACG GAATAATGCT TTGTATATCC ATGTGACATT AGTGCCTTGG
ATTGCTTCTG CAGGAGAAAT GAAAACTAAG CCGACTCAAC ACTCAGTCAA GGAGTTACGT
TCAATTGGTA TCCAGCCAGA TATTTTGGTC TGTCGTTGCG ATCGCAAACT GAGTGAAGGC
CTGAAAGAAA AAATGTCGGA GTTCTGTGAT GTACCTGTAG AATCTGTAAT TACGTCTCAA
GATGCTCAAA GTATTTATGA AGTACCTTTG ATGTTAGAAA CAGAAGGGCT AGCAGTACAA
GCTCTAGAAT TATTAAAAAT GGAGCAACGT CAACCAAACC TTAGTCATTG GAAAACTTTG
GTCAATCGTC TTTATCATAG AGATAAGCAA GATATACGGA CTAGCCAAGG TAAATTATCT
ATTACTGCTA CTGTAGAAAT TGCCATAGTA GGTAAATATA TTCAACTAAG TGATGCTTAC
CTGTCAGTGG TAGAAGCATT ACGTCATGCA GCGATCGCTG TTGGTGTAGA TTTAAATCTG
CATTTGGTAA ATGCTGAAGA TGTGGAGACC AAGGGAGCTA GGACCTACCT GGAAAAAGCT
AATGGAATTA TTGTTCCGGG TGGGTTTGGC GTGAGAGGTA TTGATGGGAA AATAGCTACA
GTTGAATATG CCAGAATAAA TAAAATTCCT TTTTTGGGAT TATGTCTAGG AATGCAATGT
GCAGTAATTG AGTGGGCACG CAATATAGCA AAATTAGATG CTGCCCATAG TTTTGAGTTT
GACCCTCAAA CACCAAATCC GGTGATTAAT TTATTACCAG AACAACAAGA TGTAGTAGAC
TTGGGCGGCA CTATGCGACT TGGACTTTAT CCATGCCGTC TTCAAGGGGA TACATTGGCA
TTTAAAACTT ATCAGCAAGA AGTAATTTAT GAACGCCATC GTCATCGATA TGAATTTAAT
AATGCTTACC GAAACTTGTT CAAAGAAACA GGATATATTA TCAGTGGAAC TTCTCCGGAT
GGTAGACTAG TAGAAATTAT TGAACTTCCT GCTCATCCTT TTTTTATTGC TACTCAATTT
CATCCAGAGT TTCAATCTCG ACCTAGTACT CCCCATCCTC TATTTCATAA TTTTTTTCAA
GCAGCTAATT CACAAGAAAA ATGTGATAAT TTATCTGTGA ATAATACTGA AGAAACCCCA
GACAATTTAG TTAGTAAAAG TTCTATTCCT AATTGA
 
Protein sequence
MTKFVFITGG VVSSIGKGIV AASLGRLLKS RGYSVSILKL DPYINVDPGT MSPFQHGEVF 
VTEDGAETDL DLGHYERFTD TSMSRLNSVT QGSIYQAVIN KERRGDYQGG TVQVIPHITN
EIKERIHRVA KNANPDVVIT EIGGTVGDIE SQPFLEAIRQ FRKDVRRNNA LYIHVTLVPW
IASAGEMKTK PTQHSVKELR SIGIQPDILV CRCDRKLSEG LKEKMSEFCD VPVESVITSQ
DAQSIYEVPL MLETEGLAVQ ALELLKMEQR QPNLSHWKTL VNRLYHRDKQ DIRTSQGKLS
ITATVEIAIV GKYIQLSDAY LSVVEALRHA AIAVGVDLNL HLVNAEDVET KGARTYLEKA
NGIIVPGGFG VRGIDGKIAT VEYARINKIP FLGLCLGMQC AVIEWARNIA KLDAAHSFEF
DPQTPNPVIN LLPEQQDVVD LGGTMRLGLY PCRLQGDTLA FKTYQQEVIY ERHRHRYEFN
NAYRNLFKET GYIISGTSPD GRLVEIIELP AHPFFIATQF HPEFQSRPST PHPLFHNFFQ
AANSQEKCDN LSVNNTEETP DNLVSKSSIP N