Gene Tery_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1054 
Symbol 
ID4241939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1648441 
End bp1649709 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content37% 
IMG OID638106286 
ProductS-layer-like region 
Protein accessionYP_720898 
Protein GI113474837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0296969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATA CACCGACAAA ACTACCATCA CAGCTAAGGC CAGTAAAACC TGATGAGTGG 
ATTGCAATAT TTGTAGCTTT AGGGACTTTT GGCAGCATAT TTTTCTGGGC AACGACTGGA
GAAAAGAATG GATTTAATCT ACTTTCTAAA CCCATGCTTT CTACACCTCT ATCTGAAAGT
TTTGGTAGTT CCAATATTGC TTCTGGGAAG TCAATATTTA GCTTAGACAT ACCTCAACTT
AAGACTTCGA CTGGTAAGTC AGGATCAAAT TTTGAAGAGA GCTCTCAAAA TCCTGAAGAA
TTAGGAACAA GTCTGGATAC TTTGACTTCT ATGACTGATC CTGTTGAGGA TGATGAACCA
AATCTTCAGA GTTTAGATCA AGTCAAGACT GAGACGAAGA CAATACCTAA ATTGTTTGAA
GTGCTAAAAA AAATTAATCA AAAAGCTGCT CCGCCAGTTG CTGATAATTT GGGAATATTA
GACAGTGGAA AGATATTATC TAAATCTCCA CCTACTATAT CATCGGAAGT TACAGATGAT
CTAATAACAA CTGCTCCTCT ACCTGTAGCA CCTTCTATTC CTACTCCAAC AACAGAGTTA
CCCCAAGAAA AGGAAACTTC TTTGCCTTCA ACAGCATTAC CCTCTCCTGA TATATCTTCT
AGTAGCACGG TTAAATTTTC TGATGTTCCT AATAGTTTTT GGGCAAGTAG TTTTATTCAA
TCTTTGGTAG AACAGGATTT TATTGCTCAG ATTAATAATG ATCAGTTTGA GCCAGACAAA
CCTGTAACAC GAGCTGAATT TGCTGCTCAA ATTGCCAAAG TATTTGAGGA AAAATCAGCT
AAAAAATCTG TTGTATATAA AGATATCAAA GGAGATTCAA CAGCTCAAAG TGAAATTCAA
ACATCTACTA AATCTGGTTT TTTAAGTGGG TATCCAGGGG ATGTTTTTCG CCCAGAAGAA
AAAGTATCAA GGTTGCAGGT GTTAGTTTCT TTAGCTAGTG GCTTAAGTCT AGAAATTCCA
TCTGATCCTG ATAGTGTTTT AAGTGTTTAC AAAGACACAA CAGAAATACC TGATTGGGCT
AAAGAAAAAG TAGCTGCTGC AACTGCTGCT GAATTAGTAG TGAGCCATCC AGATGTAAAA
ATGCTGAATC CAAATCAACC TGCTACTCGT GCTGAAGTTG CAGCAGTTTT TTATCAAGCG
TTGGTAAAAT TAGGGCAGGT AGAAAAGATT TCATCTGAGT ATATTGTGAA TCCGAAAAAG
GAGAATTAG
 
Protein sequence
MTDTPTKLPS QLRPVKPDEW IAIFVALGTF GSIFFWATTG EKNGFNLLSK PMLSTPLSES 
FGSSNIASGK SIFSLDIPQL KTSTGKSGSN FEESSQNPEE LGTSLDTLTS MTDPVEDDEP
NLQSLDQVKT ETKTIPKLFE VLKKINQKAA PPVADNLGIL DSGKILSKSP PTISSEVTDD
LITTAPLPVA PSIPTPTTEL PQEKETSLPS TALPSPDISS SSTVKFSDVP NSFWASSFIQ
SLVEQDFIAQ INNDQFEPDK PVTRAEFAAQ IAKVFEEKSA KKSVVYKDIK GDSTAQSEIQ
TSTKSGFLSG YPGDVFRPEE KVSRLQVLVS LASGLSLEIP SDPDSVLSVY KDTTEIPDWA
KEKVAAATAA ELVVSHPDVK MLNPNQPATR AEVAAVFYQA LVKLGQVEKI SSEYIVNPKK
EN