Gene Tery_3411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3411 
Symbol 
ID4244448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5221535 
End bp5222911 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content35% 
IMG OID638108394 
ProductXaa-Pro dipeptidase 
Protein accessionYP_722984 
Protein GI113476923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.141786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTC CAACAAATCA TACTTATCTA TCTAAAACAC TCCGCAACCG ACGAGAAAAA 
TTAGCAAAAT TAATTGATTT TCCAACAATA CTTTGGTCAG GTAGTTCTAG TTCCCGCAAC
TTTCCAGCTA ATACTTTTCC CTTCCGTCCT AGCAGTCATT TTCTTTATTT TGCTGGGCTA
CCTATTGAAG ATGCTGCTAT CCGTTTAGAA GGAGGAAAAT TAGAACTATT TATGGATAAT
CTATCCCCAA GTAATTTACT TTGGCATGGA GAAATACCAA CACGCGATCG CCTCGCCGAA
ATTATAGGTG CTGATGCTGC TTTTCCGATT AAAGAATTAA AAGATTATGC CGCAAATGCA
GCAACTATTT ATGTACAAAA TCCCACTACT AAAATCACAC AATGTCAGAT TTTAAATCGC
GATATTTTTC CTTCTAAGAA ACACCAAAAA ATTGACTTAG AATTAACAAA AGCAATTATC
TCTTTAAGAC TTAGCCATGA TGATATGGCT TTAACAGAAA TTAAACAAGC AGCAGCAGTA
ACAGTAAAAG CTCATAAAGC AGGAATGGCA GCAACAAAAA ATGCTAAATT TGAAGCTAAT
ATCCGTGCTG CAATGGAAAG TATTATTATT TCTCATAATA TGACCTGTGC TTATAACAGT
ATTGTAACTG TACATGGGGA AGTTTTACAC AATGGAGAAT ATTATCATCC TCTACAAACA
GGAGATTTAC TATTAGCAGA TGTGGGAGCA GAAACAACTT TAGGTTGGGC AAGTGATGTG
ACTCGTACTT GGCCTATTTC TGGTAAGTTT TCTCCTACAC AAAGAGATAT TTATGATGTA
GTTTTAGCTG CCCATGATAA TTGTATTGCT CAACTTAAAC CAGGTGTAGA ATATTTAGAT
ATTCATTTAT TAGCAGCTAA AACTATCGCC GAAGGATTAG TTAATTTAGG AATCTTAAAA
GGTCAACCAG AACAGTTAGT AGAAATGGAT GCTCATGCAT TATTTTTTCC CCACGGAGTT
GGTCATTTAT TAGGTTTAGA TGTACACGAT ATGGAAGATT TAGGAGATTT AGCAGGATAT
GAAATAGGTC GGGAACGTAG TAGTCGTTTT GGTTTAAGTT TTTTGCGATT AAATCGTCCG
TTAGCTTCTG GAATGTTAGT CACAATTGAG CCTGGTTTTT ATCAAGTTCC AGCGATTTTA
AATAACACAG AAACACGTCA AAAATATCAG CATATTGTCA ATTGGGAAAA GCTCAAACAT
TTTTCTGATG TTCGAGGTAT TAGGATTGAA GATGATGTTT TAGTTACCAC AAAAGGTGCT
GAAATTTTGA CTAAAGAATT ACCAAGCAAT ACAGATGTAA TTGAAAGTTT ACTGTAA
 
Protein sequence
MQIPTNHTYL SKTLRNRREK LAKLIDFPTI LWSGSSSSRN FPANTFPFRP SSHFLYFAGL 
PIEDAAIRLE GGKLELFMDN LSPSNLLWHG EIPTRDRLAE IIGADAAFPI KELKDYAANA
ATIYVQNPTT KITQCQILNR DIFPSKKHQK IDLELTKAII SLRLSHDDMA LTEIKQAAAV
TVKAHKAGMA ATKNAKFEAN IRAAMESIII SHNMTCAYNS IVTVHGEVLH NGEYYHPLQT
GDLLLADVGA ETTLGWASDV TRTWPISGKF SPTQRDIYDV VLAAHDNCIA QLKPGVEYLD
IHLLAAKTIA EGLVNLGILK GQPEQLVEMD AHALFFPHGV GHLLGLDVHD MEDLGDLAGY
EIGRERSSRF GLSFLRLNRP LASGMLVTIE PGFYQVPAIL NNTETRQKYQ HIVNWEKLKH
FSDVRGIRIE DDVLVTTKGA EILTKELPSN TDVIESLL