Gene Tery_1635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1635 
Symbol 
ID4242324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2495998 
End bp2496975 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content40% 
IMG OID638106774 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_721384 
Protein GI113475323 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.417453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGG TTTTAGCTAT AGAAACAAGT TGTGATGAAA CATCTGTGGC AATTGTTAAA 
AATCGTCAAG TTTTAAGTAA CATTGTGAAG TCACAAATTA ATATCCATAG CTTTTATGGA
GGAGTAGTTC CAGAGGTAGC TTCACGACAA CATTTAGAAA TAATTAATCA GGCGATCGCT
CAAGCTTTCA GAGAAGCAAA TTTAGACTGG CCAGACATTG ATGGTATTGG AGCTACTTGC
GCCCCTGGTC TAGTTGGCGC TCTGTTAGTT GGCCTGACGG CGGCCAAAAC CTTAGCTATT
GTCCATGAAA AGCCCTTTGT GGGAGTCCAT CACTTAGAAG GTCATATTTA TGCAACCTAC
CTAAGCCAAC CCGAGTTAGT ACCACCTTTT CTTTGTTTGT TAGTTTCTGG AGGTCATACC
AGTTTGATTT ATGTCAAAAA TTGTGGGGAA TATGAAACTT TAGGGCAAAC AAGAGATGAT
GCAGCAGGAG AAGCTTTTGA CAAAGTAGCT AGATTATTAA AACTTGGCTA TCCCGGTGGA
CCTGTAATTG ATAAATTAGC TGCAGAGGGT AATAAATTAG CATTTTCTCT CCCTGAAGGG
AAAATTTCTT TGCCTGGAGG AGGCTATCAT CCTTATGATT TTAGTTTCAG TGGACTCAAA
ACTGCAGTAT TGCGATTAGT CCAGAAAATA GAACAAGAAG GTAATGAATT ATTATCAGGG
TCTTCAGTAA CTAAAGACAT TGCAGCCAGC TTTCAAGAAA CTGTAGCAAA AGGGCTAACA
AAAAGAGCTA TCGCTTGTGC ATTAGACTAT AGCTTAAACA CAATTGCTAT TGGCGGTGGT
GTAGCAGCTA ATAGTAGTTT GCGACAGCAC TTAAGTAGAG CAGTAGAAAG TCATAACTTG
CAAGTACTAT TTCCCTCACT CAGATTTTGT ACGGATAATG CAGCCATGAT AGGTTGTGTG
CGGCTCGTTG GCCACTAA
 
Protein sequence
MTTVLAIETS CDETSVAIVK NRQVLSNIVK SQINIHSFYG GVVPEVASRQ HLEIINQAIA 
QAFREANLDW PDIDGIGATC APGLVGALLV GLTAAKTLAI VHEKPFVGVH HLEGHIYATY
LSQPELVPPF LCLLVSGGHT SLIYVKNCGE YETLGQTRDD AAGEAFDKVA RLLKLGYPGG
PVIDKLAAEG NKLAFSLPEG KISLPGGGYH PYDFSFSGLK TAVLRLVQKI EQEGNELLSG
SSVTKDIAAS FQETVAKGLT KRAIACALDY SLNTIAIGGG VAANSSLRQH LSRAVESHNL
QVLFPSLRFC TDNAAMIGCV RLVGH