Gene Tery_4965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4965 
Symbol 
ID4246619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7572184 
End bp7573545 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content37% 
IMG OID638109776 
Producthypothetical protein 
Protein accessionYP_724352 
Protein GI113478291 
COG category[S] Function unknown 
COG ID[COG4370] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03492] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.252399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTAA AAATACTTTG CGTGAGTAAT GGTCACGGAG AAGATGTGAT CGGGTCTCAA 
ATTTTGCAAG AACTCAAAAA ATATCCTCTT CCTCCTGATC TTGTTGCTTT ACCGATAGTG
GGAGAAGGAC AAGCTTATAC AATAGCTGAC TTCTCAATTA TTGGCCCCCG ACAAAGAATG
CCTTCTGGTG GATTTATTTA TATGGATGGA AAGCAGTTAT GGCGAGATGT TAAAAGTGGT
TTGTTAAGTA TTGTTTTGGC TCAGTTGAAA GCATTAAGAT CTTGGATAAG TGATCAAACT
AATGATGGTC ATCAAGTAGT TATTTTGGCA GTTGGGGATA TTGTGCCTTT ATTATTTGCT
TGGTTAAGTG GAGCACCTTA TACTTTTTTA GGAACTGCTA AATCTGAATA TCTGATTCGA
GATCAAAATG AGCAACTTCT CCCTAGGTCT TGGGTGGAAA GTTTCCTGCT TTCCTCAGGG
TCAGTCTATT TTCCTTGGGA ACGTTGGTTA ATGAGTAGAA AGAAATGTGG GGCAGTCCTT
GCTAGAGATG ATTTAACAGC AAAAATGTTA AAGAAAAAGT CTATTCGAGC TTATTGTGTG
GGAAACCCGA TGATGGATGG TGTCAAGTTA AAAAGCTCTA TGGAGTTAAT GTCTGGTAAT
AAAGCCCGGA TGCTAGAGAT GCATGATCAA TTAACAATTA CTTTGTTACC AGGGTCTCGT
TCTCCAGAAG CTTATGCCAA TTGGCAAATT ATTCTTCAGG CAGTGACAGG GTTGCTCGAA
AGCTTTCCAC AAAAAAAGTT TTTGTTTCTG GCAGCGATCG CTCCTAATTT AGATTTGGAA
GCTTTTACTA AACAGCTTTT GTTTGATAAT TGGCAAACTG AACAAGAAAT TCTAACTCAG
AACCAAAACA GTATTTTACA AATGCCAACT GATAAACCTG AACTAACTTT TTGTTTTAGA
GAAAAAAGTA TTCATTTTCC CATAAAATTT ATTTCTCAGA ATAAAAATGC AAGTCTGATT
TTAAATCAAC AGGCTTTTCA AGAATTTATA CATCAGGGAG ATTTAGCTAT TGCTATGGCA
GGTACGGCTA CAGAACAATT TGTCGGTTTA GGGAAACCAG CGATCGCTAT TCCTGGCAAG
GGTCCACAGT TTACATCCAC TTTTGCAGAA AATCAAAGTC GCCTTTTAGG AATTTCTCAA
ATTCTGGTTA AAGATCCTAG AGAAGTTTGT GGTGTAGTTA AGTCTTTGTT AGATAACTTA
GAGCAACGGC GCTTAATTGC TAAAAATGGT GTTAAAAGAA TGGGAGGCTC AGGTGCGGCT
AAAAGAATTG CTAATTTTTT AATTAATTTA AATTGGGTTT AG
 
Protein sequence
MTLKILCVSN GHGEDVIGSQ ILQELKKYPL PPDLVALPIV GEGQAYTIAD FSIIGPRQRM 
PSGGFIYMDG KQLWRDVKSG LLSIVLAQLK ALRSWISDQT NDGHQVVILA VGDIVPLLFA
WLSGAPYTFL GTAKSEYLIR DQNEQLLPRS WVESFLLSSG SVYFPWERWL MSRKKCGAVL
ARDDLTAKML KKKSIRAYCV GNPMMDGVKL KSSMELMSGN KARMLEMHDQ LTITLLPGSR
SPEAYANWQI ILQAVTGLLE SFPQKKFLFL AAIAPNLDLE AFTKQLLFDN WQTEQEILTQ
NQNSILQMPT DKPELTFCFR EKSIHFPIKF ISQNKNASLI LNQQAFQEFI HQGDLAIAMA
GTATEQFVGL GKPAIAIPGK GPQFTSTFAE NQSRLLGISQ ILVKDPREVC GVVKSLLDNL
EQRRLIAKNG VKRMGGSGAA KRIANFLINL NWV