Gene Tery_1184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1184 
Symbol 
ID4244068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1854403 
End bp1855686 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content32% 
IMG OID638106403 
Producthypothetical protein 
Protein accessionYP_721015 
Protein GI113474954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.165891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.040028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAA GCATACACAG TTGGAAAAGC ACATACAGTT TATTGCTGTC TTTGGTATTA 
ATATCTGGTT CTTTCGTGGC TTGTGTACAA CTAGAGCCAG AAATACCACC TAGTACTTCG
ACAACATCAA TTCCTTTAAA TCCAGATCAG GAAAATTTAA AGAACAGCTT TGATGATGTT
CAAAAAACAA ACAATGAATT AGCCGCTATA GCTAATAACA ATGAGGAAGA AATTAAGAAT
TTAAACTATG AAAAACAAGC AATTTTTAAT CAATTAAATC AGTTTAAGAA TATACGAAGA
ACAGTGATAT TTTCTTTCAT TCTATTAGTA CTATTCACAG TCTTATTGTT GTTTTATATT
TCGCTATTAA TATGGCAGAT AAAACAGAAT AATGCTGTTA ATAAACCCAG TAATTTAGAT
ACTCAAGAAC TTGTTTATGA AGTAGACTAT AAAGTAGAAG ATGCTTATTA TAAATTAGAA
AAAATGTCTA AATTATGGGA TGCTGAAACG AAAAAAATAA AAAGTCAAAT ATACTATTTA
CAACAAAATC AGCAAGTCAA GACTCCAGGT AATAATGTCG ATAATAAAAA GTTGATTGTT
CAAGTAGTAA ATGATAAAAT AGGGGAGATT TATAGGCAAT TACAACAAGA AATTGATCAA
AGATGGGATA TTAAATTAAG GAACCTACAA AATCAAATAA ACCAGCTCCA ACAAAATCAG
CAACTAAAGA CTTCAGACAA CAATGCCGAT AATAAAAAGC TGATTTTTGA AGCAGTGAAA
GAGGAAATTC ATGGGCAATT ACAACAAGAA ATTCGTAAAA GATTGGATAT TGAATTAAGG
AACCTACCAA ATCAAATAAA CCAGCCCCAA CAAAATCAGC AACTAAAGAC TCCAGATAAT
ATAGCTGGAG GTTTGCCCAA TCTTGAGTTT ATGTTAATAT ATAAAGAAGA TACCCGTTCT
TTATCAGAAA ATGCTATAGA GGTCTCAGAA GCAGACCGAA GCATGGTTCA ACCTCGTATA
AGTAGCACTC AAGCAGTAAT TATTCAGAAA GTTCGCAGAG GAAACTATTG GATACTAAAT
GAGGCTGATA TTGATTACAT GGTTCCTAGG CATAACATCA AAATTAATGA ATATAACAGT
AAGACAGTTG CAAATTTATT TGAATGTCAG GGATACCGAC CTGAATACTC CGGGTTTCAA
CTAATAAAAC CCGCCAAAGT ATCTGCTATT TCAAAAGGTG AAACTTGGCA GCTTGTCGAA
CGTGGTATAC TACAATTTGA TTAA
 
Protein sequence
MWKSIHSWKS TYSLLLSLVL ISGSFVACVQ LEPEIPPSTS TTSIPLNPDQ ENLKNSFDDV 
QKTNNELAAI ANNNEEEIKN LNYEKQAIFN QLNQFKNIRR TVIFSFILLV LFTVLLLFYI
SLLIWQIKQN NAVNKPSNLD TQELVYEVDY KVEDAYYKLE KMSKLWDAET KKIKSQIYYL
QQNQQVKTPG NNVDNKKLIV QVVNDKIGEI YRQLQQEIDQ RWDIKLRNLQ NQINQLQQNQ
QLKTSDNNAD NKKLIFEAVK EEIHGQLQQE IRKRLDIELR NLPNQINQPQ QNQQLKTPDN
IAGGLPNLEF MLIYKEDTRS LSENAIEVSE ADRSMVQPRI SSTQAVIIQK VRRGNYWILN
EADIDYMVPR HNIKINEYNS KTVANLFECQ GYRPEYSGFQ LIKPAKVSAI SKGETWQLVE
RGILQFD