Gene Tery_4599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4599 
Symbol 
ID4246253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7072002 
End bp7073312 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content35% 
IMG OID638109472 
Productaminopeptidase P 
Protein accessionYP_724048 
Protein GI113477987 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT CAACAGAATA TAAACAACGA CGCAAACAAT TAATAACAAA AATTGGCAAT 
GGTACGGCTA TATTTAGAAG TGCGCCAATG GCTGTAATGC ACAATGATGT AGAATATGCT
TATCGTCAAG ATAGCGATTT TTTTTATTTA ACAGGTTTTA ATGAACCGGA AGCTGTGGCA
GTTATTGCAC CACACCATGA AAAGCATAAA TTTGTTCTGT TTGTACAACC AAAAGACCAA
TTAAAAGAAA CTTGGACTGG TTATCGTGCT GGAGTGGAAG TTGCTAAGGA AAAGTATGGT
GCTGATGCAG CTTTTTCTAT TAATGAACTG AATAAAAAGT TGCCTGAATA TTTGAAAAAG
GCTGATAAAA TTTATTATCG TTTGGGACGC GATCGCAACT TTAATGAAAC AGTATTTAAA
CATTGGCAAA ATTTAATGCG AGTCTATCCG AAATCTGGCA CTGGTCCAAT AGCAATTCAA
GATGCAGGGA CAATTTTACA CCCAATGCGT CTTGTTAAAA GTGCTAAGGA ATTAGAACAA
ATGCAAAAAG CTGCTGATAT TGCTGTTAAT GCTCATAATT ATGCGCTCAA GTTTGCTCAA
GCAGGTCAGT TTGAATATCA AATTCAAGCG GAAATGGAGT ATATATTTTC TCGTCATGGA
GCTACTCCTG CTTATCCTTC TATTGTTGCT TCTGGTGCTA ATTCTTGCAT TCTTCATTAT
ATAGAAAATA ATCGACAAAT GCAAGAAAAT GATTTGTTAT TAATTGATGC TGGAGCTGCT
TACAATTATT ATAATTCTGA TATTACTCGA ACTTTTCCCA TAAGTGGGAA ATTTACCCCA
GAACAAAAGA TTATTTATGA GTTAGTTTTA AGGGCACAGT TAGCGGCAAT TGAACAAGTA
AAACCAGGAA ATCCTTATAA GCAAATTCAC GAGACAGCAG TGCGAGTTTT AGTGGAAGGA
TTGATAGATT TAGGAATGTT AAAAGGTAAT ATTGATGAAA TAATTGAAAA GGAAAAATAT
AGGCCTTTTT ATATGCATAA AACCGGACAT TGGTTGGGTT TAGATGTTCA TGATGTAGGT
GTTTATCAGT GGGGAGAAGA ACCTCAAATT TTACAACCAG GACAAGTTTT GACTGTGGAA
CCTGGTATTT ATATTGGTCT TAATATTAAA CCTGCTGAAG GTCAACCGGA AATATATGAT
CGTTGGCGTG GAATTGGAGT AAGAATTGAG GATGATGTTT TGGTTACTGC AGAAGGATGT
GAAGTATTAA CTGCGGGAGT GCCTAAGTTA GTTGAGGATT TAGAAAGTTA A
 
Protein sequence
MAISTEYKQR RKQLITKIGN GTAIFRSAPM AVMHNDVEYA YRQDSDFFYL TGFNEPEAVA 
VIAPHHEKHK FVLFVQPKDQ LKETWTGYRA GVEVAKEKYG ADAAFSINEL NKKLPEYLKK
ADKIYYRLGR DRNFNETVFK HWQNLMRVYP KSGTGPIAIQ DAGTILHPMR LVKSAKELEQ
MQKAADIAVN AHNYALKFAQ AGQFEYQIQA EMEYIFSRHG ATPAYPSIVA SGANSCILHY
IENNRQMQEN DLLLIDAGAA YNYYNSDITR TFPISGKFTP EQKIIYELVL RAQLAAIEQV
KPGNPYKQIH ETAVRVLVEG LIDLGMLKGN IDEIIEKEKY RPFYMHKTGH WLGLDVHDVG
VYQWGEEPQI LQPGQVLTVE PGIYIGLNIK PAEGQPEIYD RWRGIGVRIE DDVLVTAEGC
EVLTAGVPKL VEDLES