Gene Tery_0807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0807 
Symbol 
ID4241770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1282672 
End bp1283964 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content39% 
IMG OID638106085 
Productpentapeptide repeat-containing protein 
Protein accessionYP_720697 
Protein GI113474636 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT ATTCTAACTT CTCTCCAAGT AATAATCCTA ATTCCCACTC TACATTAGAA 
AGTCCAAATG GTACCCCCGA AGCAATTACT TCTGTAAAAA AGTTAAACTT CATAGATAAT
GAAATAGAAC AAATGCTTCC CACCCCTGCG ACACCCAATA CTTTAGTGGA GCATCAAAAA
TCTGAAGGGC CAAAAAATCC TGTAGCATCT ATTGTTACTC TAGTAGCGAT CGCATTGATG
ATTCTGGGTT TAGCCATAGA TAATGTGCTG CTTGGTTATA CCAGTGCTAT TATAGTAATA
CTATCTTCAG TCAAAATGAT TTGGCCTAGT TGGGGTAAAG TCTGGAAAAC TTTGATTCCT
TCAGTTTGGC GTAATCTAAT CATTGCCTGC TTTGGTCTCC TGGCAGCTAT TGTTGGTTTG
CTGATGTTAA GTGGGGCAAA CCAACAACCT GGTAGTAGAA ATATCCACAT TAACTGGGAT
GCTATTGGAG CGGTGGGTGA ACTTATTGGA GCTTTGGGTC AAATTTTAAT TGCAATAATT
GCTGTATATG TAGCTTGGCG ACAATACGTT ATTTCTAAAG ATTTGACAAT TCAACAAAAC
CGCATTACTC AACAACAAAC TATTGATGCT TACTTTCAAG GGGTTTCTGA TTTGGCAATG
GATGAAAAAG GTTTCTTGGA AGATTGGCCA CAGGAACGAG CGATCGCTGA AGGTCGTACA
GCGGCTATTA TTAAAAGTGT AGATGAAGAA GGGAAAGCTA AAATTCTCAG ATTTTTATCT
CAGTCTAGAC TGGTAACACC AATTAAACGT GATAGACTGC TAGGCCGTCC CATATTTGAT
GGTCAAGGTG GTTATGCTGA AGATAGGGAA CATGGTACTC GTGTTATTGA TTTAGGAGTA
ATGTTAGCAG GTGCTGACCT GAAAAACACA GATTTGCGGT GGACAGAGTT AAGTGATGCT
AATTTAGTGA GAGCTAATCT TAGTGGCTGT GATTTAGTCA AGGCTAATTT CTCTCGTACT
ATTCTATATG AAGCAAGTTT GGTAGGTGCT GATTTGAGGG GAGTCAGATT TTTCTATGGT
ACTGCTGAAT ATGCTACTCC CCGCAGTCGT ACTCATATAC CTAACTATCA AACTGGTGCT
TATACTGGTG CTGTGGTAGA AAATGTTGAC TTTACGGAAG TGAAGCGGTT GTCTGATGAA
CAACATTATT ATTGTTGTGC TTGGAGTGGA GAAAGAAGTA GAAAAACTAT ACCAAATGGT
TGTGGAGGTA TTCCAAATAA GTTAGGGCGT TAA
 
Protein sequence
MNNYSNFSPS NNPNSHSTLE SPNGTPEAIT SVKKLNFIDN EIEQMLPTPA TPNTLVEHQK 
SEGPKNPVAS IVTLVAIALM ILGLAIDNVL LGYTSAIIVI LSSVKMIWPS WGKVWKTLIP
SVWRNLIIAC FGLLAAIVGL LMLSGANQQP GSRNIHINWD AIGAVGELIG ALGQILIAII
AVYVAWRQYV ISKDLTIQQN RITQQQTIDA YFQGVSDLAM DEKGFLEDWP QERAIAEGRT
AAIIKSVDEE GKAKILRFLS QSRLVTPIKR DRLLGRPIFD GQGGYAEDRE HGTRVIDLGV
MLAGADLKNT DLRWTELSDA NLVRANLSGC DLVKANFSRT ILYEASLVGA DLRGVRFFYG
TAEYATPRSR THIPNYQTGA YTGAVVENVD FTEVKRLSDE QHYYCCAWSG ERSRKTIPNG
CGGIPNKLGR