Gene Tery_4649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4649 
Symbol 
ID4246303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7148647 
End bp7150752 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content39% 
IMG OID638109516 
Productoligopeptidase A 
Protein accessionYP_724092 
Protein GI113478031 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.621087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.228337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCTA ATACAACAGT TACAAAAAAA ACATTACTAA TTGGTAAGGG ATTACCCCCT 
TTTGAATCTA TAAAACCAGA AGATGTAGTA CCCGCAATAA CAGAATTACT GACAGAATTA
GAAAAAGAAC TAATCAACTT AGAATTAATA GTTAAACCAA CTTGGAATGA CCTAGTAGAA
CCTCTACAAA AATTAGAAGA TCGTTTAACC TGGAGTTGGG GAATAGTAGG ACATTTAATG
GGAGTTAAAA ATAATCCTGA ACTACGAAAA GCTTATGATC AAGTACAACC CAAAATAGTA
GAGTTTATTA ATAAACTAAA TCAAAGTAAA CCTCTTTATC AAACCTTTAA AAATTTAAGT
AATAGTGATA GCTGGCAAAA CCTAGATTCA GGACAAAAAC GTATAGTTGA AGCTGCCATC
AAAGATGCAG AACTTTCAGG AGTAGGTTTA AAAGGAGAAA AACGAGAACA CTTCAATGCT
ATTGAACTAG AACTAGCAGA ACTTTCTACT AAATTTTCTA ATAATGTTCT TGATGCTACC
AAAGCTTTTA GCCTAATTTT AACTGAAAAA GAAGAAGTAG ACGGTTTACC ACCCAGTTTG
TTAAGTCTAG CTGCTCAAGC TGCCAGAGAT AATGGATCCG AGAATGCTAC ACCAGAAAAC
GGACCTTGGC GGATAACCCT CGACTCTCCC AGTTTTTTAC CCTTCATGCA ACACTGCAAA
AGACGGCAGT TGCGAGAACA ACTTTATAAA GCCTTTATTA GTCGTGCTTC TAGTGAGAAG
TTAAACAACT ATCCTTTAAT TGAACGTATT CTGGAGCTAC GTCAGCAAAA AACAGAAATT
TTGGGCTTTA ATAGTTATGC AGAATTGAGT CTTGCCAGTA AAATGGCTCC TAGTGTGGAA
GCAGTAGAAA AGTTATTAGA AGAATTACGC AGTGTTAGTT ATGATGCTGC AGTTAAAGAC
TTAGAAGAAC TAAAACAGTT TGCTGCCAGC CAAAATGCAC CAGAGGCCAA GGAGTTTAAA
CCCTGGGATA TGAGCTTTTG GTCAGAAAGA CTGCGAGAAG AAAAATTTTC CTTTACTACA
GAAGAGTTGC GACCATATTT CCCCCTGCCA CAGGTTCTTG ATGGGTTATT TAGTTTAGTC
AAGCGTATAT TTGGCATTAA TATTACTGCC GCAGATGGGG AAGCTCCAGT ATGGCATGAA
GATGTGCGCT ATTTTAAAAT TAGTGATGAA ACAAACAGAC CCATTGCCTA TTTTTACCTT
GATGCTTATA GTCGTCCTGC TGAAAAACGA GGTGGTGCAT GGATGGATGA CTGTATAAAT
CGTGCTAAAA TTGTAGAGAA TGGTCAAACT ACTTTGCGTT TACCTGTAGC TTATTTACAA
TGTAATCAAA CACCGCCTGT AGATGGTAAA CCTAGTTTGA TGAATTTTAG TGAGGTAGAA
ACTCTGTTCC ATGAATTTGG ACATGGTTTA CAACATATGC TGACAACAGT TGACTATGGA
GGTGCTTCAG GTATTAATAA CGTAGAGTGG GATGCGGTAG AGTTACCTAG TCAGTTTATG
GAAAATTGGT GTTATGACCG CTCCACTTTA TTCGGAATGG CGAAACATTA TGAAACAGGG
GAGGTTTTGC CAGAACATTA TTATCAGAAA CTTTTGGCTG CTAGAAATTA TATGAGCGGT
AGTGCGATGT TGCGCCAGTT GCATTTTGCT TTAGTTGATA TTGAGCTCCA TCACCGTTAT
CGACCAGCTG GGGAAGAAAC AGTGCTAGAT GTGCGTAAGC GTGTTGCTGA AACTACTACT
GTTTTGGAAC TATTGCCAGA AGACTCATTT TTATGTGCAT TTGGGCATAT TTTTGCTGGT
GGTTATGCTG CCGGTTATTA TAGTTATAAG TGGGCGGAGG TATTGAGTGC TGATGCTTTT
GCCGCTTTTG AAGAGGCTGG TTTGGAAAAT GAACAGGCGA TCGCTACTTG TGGTCAGCAG
TTTAGGGATA CAGTTTTAGC TTTGGGCGGG AGTCTGCATC CGATGGAGGT ATTTAAAACT
TTCCGAGGTC GAGAACCTAG TACTGAACCT TTGTTAAGGC ATAGTGGTTT GGTAGCTACA
GCTTAA
 
Protein sequence
MSPNTTVTKK TLLIGKGLPP FESIKPEDVV PAITELLTEL EKELINLELI VKPTWNDLVE 
PLQKLEDRLT WSWGIVGHLM GVKNNPELRK AYDQVQPKIV EFINKLNQSK PLYQTFKNLS
NSDSWQNLDS GQKRIVEAAI KDAELSGVGL KGEKREHFNA IELELAELST KFSNNVLDAT
KAFSLILTEK EEVDGLPPSL LSLAAQAARD NGSENATPEN GPWRITLDSP SFLPFMQHCK
RRQLREQLYK AFISRASSEK LNNYPLIERI LELRQQKTEI LGFNSYAELS LASKMAPSVE
AVEKLLEELR SVSYDAAVKD LEELKQFAAS QNAPEAKEFK PWDMSFWSER LREEKFSFTT
EELRPYFPLP QVLDGLFSLV KRIFGINITA ADGEAPVWHE DVRYFKISDE TNRPIAYFYL
DAYSRPAEKR GGAWMDDCIN RAKIVENGQT TLRLPVAYLQ CNQTPPVDGK PSLMNFSEVE
TLFHEFGHGL QHMLTTVDYG GASGINNVEW DAVELPSQFM ENWCYDRSTL FGMAKHYETG
EVLPEHYYQK LLAARNYMSG SAMLRQLHFA LVDIELHHRY RPAGEETVLD VRKRVAETTT
VLELLPEDSF LCAFGHIFAG GYAAGYYSYK WAEVLSADAF AAFEEAGLEN EQAIATCGQQ
FRDTVLALGG SLHPMEVFKT FRGREPSTEP LLRHSGLVAT A