Gene Tery_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1119 
Symbol 
ID4242851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1761243 
End bp1762259 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content38% 
IMG OID638106342 
Productgroup II intron, maturase-specific 
Protein accessionYP_720954 
Protein GI113474893 
COG category[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0296969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAT GTTTAGAAGA ATTTGCCAAA ACCCTTCCAG GGAGAAAGCA TGACAACATA 
CAGGCATTAT CCTTAATAAG ATACGCAGAT GATTTTGTAA TCCTACACAA AGACATCAAA
GTATTAATAC AAGCAAAAAC CTTAATACAG GAATGGTTAA ACCAGGTAGG ATTAGAGCTA
AAACCAGAAA AGACTAAAAT TGCCCACACC CTGGAAGAAT ATGAAGGAAA CAAACCCGGA
TTTAATTTTT TAGGATTTAC AATAAGGCAA TGGAAAGTCA AGACAACCAA ACAAGGATTT
AAGACACTGA TTAAGCCATC ATCTAAGAGT ATTAAAACTC ATTATCGGAA GTTGGCGGAT
ATATGTGATT GCCACAAAAC CGCCCCTACG AAAGCTTTAA TAGCTAAACT AAATCCGATA
ATCAGGGGAT GGGCCAACTA CTTCTCCACT GTAGTCAGTA AAGAGGTATA CAGCAAATTA
GACTGCCTCC TGTGGAAAAG GATATGGAGA TGGGGAAGTA GACGGCATCC AAACAAGTCA
GCCAAATGGG TAAAACAAAA GTATTTCCCT CGCTGCAAAG AGACCAGAAA CTGGTTACTT
AATGACGGCG AATACGTACT TAACCTACAC GCAGACGTAG CTATAAAAAG ACACGTCAAG
GTAAAGGGCA ATAAATCCCC TTATGACGGC GATTGGACTT ATTGGAGCAG TAGAATCGGT
AAACACCCAG GCATAAGGAA AGAAGTAACA ACGCTGTTAA AACGGCAAAA GAATAAATGC
GCATTTTGTG GATTAACCTT CAGATCAAAC GACCTTATGG AAATTGACCA TATAAAACCA
AGGTCTGAAG GTGGTGACAA CACAGTTAAA AATAAACAAC TGCTACACCG ACACTGCCAC
GATACTAAAA CTGCTTTAGA TAATAAAACA TACACAAAAC CTAAGTTACA GGACTTACCT
GATGAATACC TATGGGTAAA TGATATGTTG ATTCTAAAAA CAGGGATGTA CCTATGA
 
Protein sequence
MEKCLEEFAK TLPGRKHDNI QALSLIRYAD DFVILHKDIK VLIQAKTLIQ EWLNQVGLEL 
KPEKTKIAHT LEEYEGNKPG FNFLGFTIRQ WKVKTTKQGF KTLIKPSSKS IKTHYRKLAD
ICDCHKTAPT KALIAKLNPI IRGWANYFST VVSKEVYSKL DCLLWKRIWR WGSRRHPNKS
AKWVKQKYFP RCKETRNWLL NDGEYVLNLH ADVAIKRHVK VKGNKSPYDG DWTYWSSRIG
KHPGIRKEVT TLLKRQKNKC AFCGLTFRSN DLMEIDHIKP RSEGGDNTVK NKQLLHRHCH
DTKTALDNKT YTKPKLQDLP DEYLWVNDML ILKTGMYL