Gene Tery_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1349 
Symbol 
ID4241846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2057447 
End bp2059189 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content30% 
IMG OID638106524 
Producthypothetical protein 
Protein accessionYP_721135 
Protein GI113475074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0362113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAA TATTTTCCCG AACCAGAATA TTATTAGTAA CCCTATATAT TTTTCCAGTC 
TTATTACTCC TTTGGTTTCT CGCTAATTTT AGTGTCAATG TTCCCTACTG GGACCAATGG
AGATTAACCC CTATATTTGA GAAAGTTGCT GAAGGAAACG CCACATTTTT TGACTTTTTC
ACAGTACACG GACATCACAG AATATTAATA CCAAAATTAA TCATTACTGC CTTAGCTTTT
GCCACTAAAT GGAACACTCA AGCTGAAATA ATTTGTAGTG TAATTTTTGT CATCATCACA
TTTTTTGCTG TTTGTAAAAT AGCCCAAATT AACTTTCATA ATCAAAATCA AAATATTATA
AATATAGCAA ATATTTTAAG CTGTCTATTT TTATTTTCCC TAGTACAACA TCAGAACTGG
TTATGGGGAT TTACTCTATT TTGGTTTCTC ACAAACTTAT GTTTAATAAT GGCGGTATAT
TTGATTCATG TCTTAGAGAA TATATCGGCA AAAAGAAGAA TATTTCTAGC AGCAATATTT
TGTTTTATCG GAAGTTTTAC ATTAATACAA GGATTATTTT CTTGGCTAGT TTTAATTCCG
TCAATATTGT CTATAGAAGG GAAAGCTAAA CAAAAAATTA CTAGACTAGT GATCTGGAGC
TTACTGTTTG TAGTTTCATG TATTATCTAC GCAATTAACT TCAATCCAAT TCGCGAACCA
AAACCATTTT TATTTACAGA AAAACCATTT TTGATAATAA ATTATTTTTT AGCCGTTATA
GGTTTACCTC TAGTTAGACT TCCCATACCT GCAATTTTAA CAGGATTAAT ATTATTCTCT
AGTTTCCTAT TTTTTACATA TTACTTTCTC AAGAAACCTT TGTATAAACT AACTTTTTTT
CCTCCTACTA TTCCTTGGTT AACAATAGGT ACATTTTCTA TAATTTCTGC TCTATCTATT
TCTGTAGGTA GAGCAGAATA TGGGGTGGAA AATGCTATGA CTTCTTCCCG ATATACAACC
ACATCAATAT TATTAATTAT CTCTTTAGTT TATCTCTTGG CTTTGTTTAT ACAAAAGCAA
CAAAAATATT TATTAATATA TAAAATTTTA GCTGTTGCAA TCACAGGAAT TATGCTCATA
AACTCCCTCT ATGTTGTGAG GAATATTAAA TTAAGTTTTC CCTATATTGA GAGTCGGAAA
GAATGTCTAG AAATAATTAA CTATTTAGAA GATTCAGAAT TTATAAAAAC ATCTCCAGAT
AGTTGTTTAG TGTTAATGAA TGGCAAAACT TGGCTAGTCA GACGAGGTGC AGAAATTATG
GCAAAACTCG GATGGCGAGA ATTTCCTCAA AATCTACAAT TTATCAAACA ACCAAAACAA
AATTATGGTT ATTTAGATCA TCCTCAAACA ACTGCAAAAC CTCTAATTAT TAAAGGTGAA
GAAACTCTAA ATTTGGGAGG TTGGGCAATT CAACCTGATG GAAAAGAACA ACCTAACTTA
GTGTTACTTT CTTCTGGTAA TAATCAAGAT TTTTTTGCTA ATGCTATTGT TAACTTAGAG
AGTTATGATA TTGCTAAAGT TATGAAGTCA AAACTTTATA GTAGAGCCAG ATGGAAAGTA
AAATTTTCAG CGAAATCTTT ACCTGTAGGA GAAACTGTTA TTAAAGCTTG GGTTTATAAC
CCTAATAAAC AAGAATTTGT TAAATTAAAC AATGAAGTTA ATGTCAGGGT AGAAAAGTCT
TGA
 
Protein sequence
MRKIFSRTRI LLVTLYIFPV LLLLWFLANF SVNVPYWDQW RLTPIFEKVA EGNATFFDFF 
TVHGHHRILI PKLIITALAF ATKWNTQAEI ICSVIFVIIT FFAVCKIAQI NFHNQNQNII
NIANILSCLF LFSLVQHQNW LWGFTLFWFL TNLCLIMAVY LIHVLENISA KRRIFLAAIF
CFIGSFTLIQ GLFSWLVLIP SILSIEGKAK QKITRLVIWS LLFVVSCIIY AINFNPIREP
KPFLFTEKPF LIINYFLAVI GLPLVRLPIP AILTGLILFS SFLFFTYYFL KKPLYKLTFF
PPTIPWLTIG TFSIISALSI SVGRAEYGVE NAMTSSRYTT TSILLIISLV YLLALFIQKQ
QKYLLIYKIL AVAITGIMLI NSLYVVRNIK LSFPYIESRK ECLEIINYLE DSEFIKTSPD
SCLVLMNGKT WLVRRGAEIM AKLGWREFPQ NLQFIKQPKQ NYGYLDHPQT TAKPLIIKGE
ETLNLGGWAI QPDGKEQPNL VLLSSGNNQD FFANAIVNLE SYDIAKVMKS KLYSRARWKV
KFSAKSLPVG ETVIKAWVYN PNKQEFVKLN NEVNVRVEKS