Gene Tery_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1471 
Symbol 
ID4241677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2231265 
End bp2232536 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content43% 
IMG OID638106624 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_721234 
Protein GI113475173 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0600936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATAT TAATAGTAGG TAACGGTGGT AGAGAACACG CGATCGCCTG GAAACTATCC 
CACTCTTCCC AAGTTCAACA AATATTCTGT ACCCCTGGCA ACGGCGGCAC AGCTACCCAG
GCAAAATGTC AAAACATAGT CATCCCAGTG ACAGACTTTC CAGCTCTCAG ACAACTCATC
GCTGACGAAA ACATTTCTCT AGTGGTTGTT GGCCCTGAAG TTCCTTTAGC CCTCGGCATT
ACCGACTACC TACAAAAATC CCATCTCAAA ATATTCGGCC CCACTAAAGC AGGGGCTATT
CTTGAAGCGA GCAAATCTTG GGCTAAACAA TTCATGGCCG AAGCAAATAT TCCCACTGCT
AAGTCAGCCA CCTTTACCAG GGCTGAAGCT GCTAAAGAAT ACATCACAGA AATTCCCATA
GTCATTAAAG CAGATGGTCT AGCGGCTGGA AAAGGAGTTA CGGTAGCCAC TACAATTGAA
ATGGCTCACA ATGCTATAGA TAGGATCTTT CAAGGAGAAT TTGACAAAAA AAACCAATCT
GTAGTTATCG AAGAGTTCCT CACAGGTCAA GAAATATCAG TATTAGCCCT AACCGATGGA
CTAACCATTC GTCCCCTATT ACCAGCCCAA GACCATAAAG CAGTAGGAGA AGGAGACACA
GGACCTAACA CTGGAGGTAT GGGTGCTTAT GCCCCTACTC CTATTGTTAC CCCCGAACTC
ATGACCCACA TTCAACAACA AGTCCTAGAA CCAACTTTGA CTAATTTACA AAAACGTGGT
ATAGATTATC GAGGTGTGAT TTATGCTGGA TTAATAATAA GTCCCACAGG AGAGGCAAAA
GTATTGGAAT TTAACTGTAG GTTTGGCGAC CCAGAAACTC AACCTATTCT GTCATTGCTG
GAAACTCCAT TGGAGGAATT ATTATTAGCT TGTTGTGAAC AACGTTTGGG TCAGCTGCCC
CCTATTAATT GGAAATCAGG AGTAGCAGTA TGTGTAGTTT TAGCTTCTGG GGGTTATCCA
GGAACTTATC AGAAAGGAAA AATCATATGT GGAGTAGAAA AGGCTGAAGC CACTGGAGTA
AAAATATTTC ACGCAGGAAC AAAATTCCAG GCACAAAATT TAATCACTGA GGGTGGTCGT
GTCTTGGGAG TAACAGCAGT GGGGGCTAAT TTTACTGAAG CCAAGGCAAA AGCTTATGCT
GCTGTAGATT GTATTCAATT TGAAAGAATG TACTATCGCA GAGATATAGG TCGTCAGGTT
TTTAGCAATT AG
 
Protein sequence
MKILIVGNGG REHAIAWKLS HSSQVQQIFC TPGNGGTATQ AKCQNIVIPV TDFPALRQLI 
ADENISLVVV GPEVPLALGI TDYLQKSHLK IFGPTKAGAI LEASKSWAKQ FMAEANIPTA
KSATFTRAEA AKEYITEIPI VIKADGLAAG KGVTVATTIE MAHNAIDRIF QGEFDKKNQS
VVIEEFLTGQ EISVLALTDG LTIRPLLPAQ DHKAVGEGDT GPNTGGMGAY APTPIVTPEL
MTHIQQQVLE PTLTNLQKRG IDYRGVIYAG LIISPTGEAK VLEFNCRFGD PETQPILSLL
ETPLEELLLA CCEQRLGQLP PINWKSGVAV CVVLASGGYP GTYQKGKIIC GVEKAEATGV
KIFHAGTKFQ AQNLITEGGR VLGVTAVGAN FTEAKAKAYA AVDCIQFERM YYRRDIGRQV
FSN