Gene Tery_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1000 
Symbol 
ID4245527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1564055 
End bp1565539 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content36% 
IMG OID638106239 
Productankyrin 
Protein accessionYP_720851 
Protein GI113474790 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGC AAAAAGAAGA AACTGCTTTA GTACAGGCTG TTAAAAATAG CAATTTTGCT 
CAAGTATCTA TCCTCGTGGC TCAAAATATT AATGCTAATG CTCAAGCTCT AGATGGTACA
ACTGCTCTAA TGGTGGCCGC AGAAAAGGGC TATACTCAAA TAGCCTCTTT ATTACTAGAT
AAAGGCGCTA ATGTCAACTA TACGAAAAAA AAGTTTGGCG CTACAGCTTT AATGTTGGCT
GCGGCTCATG GGCGATCTGA AATAGTAGAA GTTTTGTTAA GTAGAGGTGC TGATGTCAAT
GCTAAAAATT ATGATGGTGC TTCTCCTCTA ATGGTCGCCT CTATGAATGG TTATCTGTTA
GTGGTGAATC AGCTAATAGC CGCTGGGGCA GATGTTAATG TGGTAGACAA AGATCATGAT
ACCGCTTTGG GTTTAGCAAC CGCTCAAGGT TATCAAAATG TAGTTCAAGT TTTGCTAGAT
GCTGGGGCTA GAATAGATGA GTCTACTTTA TCTGTAGTTG CTCATAGTAA TCATGCCAAA
ATGAGAGAAA TTTTGCTTAA TTATGGTGTG GATGTAAATA CTAAAAATTT ACAGGGTAAA
ACATGGTTAA TACAAGCAGC AGAAGCAGGT GATTTATCTA CTGTCAAGAC ACTATTAAAA
GCTGGTGCTG ATGTTAATTT TAGAGATAAA GATGGAGAAA CTGCCCTGAT TTTATCTGCT
GATCAAGGTG ATTTAGAAAT TGTCAAAGCA TTATTAGAAT CAGGGGCAGA TGTAAATATT
AAAAGCAGAA GTGGTGGAAC AGCTTTGATG GCTGCTGCCG CTGAAGGAAA TCTAGTGATC
GTCTCTACTT TATTAGATGC TAATAGTGAT GTTAATGCTC AAGATTTAGA AGGAGAAACA
GCACTAAGTT TTGCAATAGG AGAAAATCAT ACTGAAACAG TCAAAATTCT CCTTGAACAT
CAGGCAGAAA TAATTACTAA AAATCAAGCT GGCGATACAC CTTTATTCAG TGCAATTTTT
CATGGTTATA CGGATGTTGT CTCAATTTTA TTGGCAACAA TAGAAAAGCA AAATCTGACA
TCTTTACTAA ATAGTAAATA TTTAGGAGAA ACAGCCTTAA CTTTAGCAAT TTGGCAAAAA
AATAGGGAGA TTATTAATAT TTTGCTAGAT GTTGGTGTAG ATATAAATAT TCCAGCTCAG
GGAGGTTATA CCCCAATGAT AAAAGCAGTA TACCAAGGTG ATATTGAAAC ATTAAAGATA
CTTTTAGGAA GAGAAGCCGA TATTAATCTT AGAGATGATA ATCAGGCAAC AGCTTTAATG
TGGGCAGCAT ATCAAGGTCA TACTGAAGCT GTAAAATTGT TAATTGATTC TGGAGCTAAT
TTAACCTATA AAAATACAAG TGGTTATACA GCTTTAATGC TAGCAGAATT TAATGGTTAT
CAAGATATTG TTAGATTACT TAGTAATGCT AAAGGAGAAC ATTAA
 
Protein sequence
MASQKEETAL VQAVKNSNFA QVSILVAQNI NANAQALDGT TALMVAAEKG YTQIASLLLD 
KGANVNYTKK KFGATALMLA AAHGRSEIVE VLLSRGADVN AKNYDGASPL MVASMNGYLL
VVNQLIAAGA DVNVVDKDHD TALGLATAQG YQNVVQVLLD AGARIDESTL SVVAHSNHAK
MREILLNYGV DVNTKNLQGK TWLIQAAEAG DLSTVKTLLK AGADVNFRDK DGETALILSA
DQGDLEIVKA LLESGADVNI KSRSGGTALM AAAAEGNLVI VSTLLDANSD VNAQDLEGET
ALSFAIGENH TETVKILLEH QAEIITKNQA GDTPLFSAIF HGYTDVVSIL LATIEKQNLT
SLLNSKYLGE TALTLAIWQK NREIINILLD VGVDINIPAQ GGYTPMIKAV YQGDIETLKI
LLGREADINL RDDNQATALM WAAYQGHTEA VKLLIDSGAN LTYKNTSGYT ALMLAEFNGY
QDIVRLLSNA KGEH