Gene Tery_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0652 
Symbol 
ID4243159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1064179 
End bp1065519 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content39% 
IMG OID638105951 
Producthypothetical protein 
Protein accessionYP_720564 
Protein GI113474503 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA TAGCTTATTT TGAATGTCCC ACTGGTATTG CAGGCGATAT GTGTCTGGGT 
GCTTTAGTCC ATGTTGGTGT TCCTCTAAAC TATCTAATAG AAAAACTAAA TCTGTTGGGT
ATTTCTCAAG AATATCAGTT AAGTGCTGAG AAAGTCCATC GTAATGGTTT AGTTGCTACA
AAATTTCATG TTAGACTTAT TCATGCAAAT CCTGAAGACG GGGAACCTCT GACTTTTGAT
CATCACCATG ACCATCATCA CACTCATAAT TTAGAAAATA AATCTAGCAC AAAAAAACAT
CATCATACTC ATCACCGCCA TTTGCCAGAA ATTGAAGCAT TAATCAAAAA AGCAGGGTTG
CCCCAACGTG CTGAAGATTG GAGTTTAAAA GTGTTTCGTA AGTTAGCAGA AGCAGAGGGG
GCTGTACATG GTATTGCCCC AAAGCAAGTA CATTTTCACG AAGTGGGGGC AACTGATGCA
ATTATTGATA TTGTAGGAAC TTGTTTGGGG TTAGATTGGT TGGACATTGA CTTACTGTAT
TGTTCACCTC TGCCTATTGG TGGAGGAACT GTTAAAGCAG CTCATGGTCG ATTACCTGTA
CCTGTACCAG CAGTGCTCAA ACTTTGGGAG TTGCATAATG TTCCGATTTA TAGTAATGGT
TTGGAAAAAG AATTGTGTAC TCCTACAGGT AGCGCGATCG CTTGTACTTT TGCTAGTAGT
TTTGGTCCAC CACCGCCAAT GTTTTTGGAA CGAGTAGGTT TGGGAGCAGG TTCCCAGAAT
TTAGCTATTC CTAATATTCT TCGCTTATGG ATCGGTGAAG GAAAAGCTAG TACTCAAAAT
TTTCAAATTA CAAAATCTGA TGCTCCCACT TCTCAAGATA TTCACCTAGA AACTGTGTCA
GTGTTGGAAA CTCAAATAGA TGATTGTTCT CCTCAAACTA TCGCTTATAC TTTTGATGCT
CTGTTTGCTG CGGGAGCCTT GGATGTTTTT AGTCAGCCTG TAACTATGAA GAAATCTCGT
TTGGGAGTTT TACTAACTGT TATTTGTACA CCAGAAAAGT TGTCTGCTTG TGAAGAAGTA
ATATTTCAAG AGACTACGAC TTTGGGTATT CGTTGCTCTA TTCAACAACG GAGTATTTTG
AAGCGAGAAA TTCATCAGGT GCAAACAGAA TATGGAGCCA TCCGACTAAA AGTAGCAAAG
AAGGGGGAGA AAATTGTGAA TGTACAACCG GAATATGAAG ATTGTGCTGC ATTAGCTAGA
CAGGAAAATA TGCCTTTGAT AGAAGTACAT AAAATGGTAT TGCAGAGTTG GCAATTGCAC
TACTCAGAAG GTTTGAAATA A
 
Protein sequence
MNKIAYFECP TGIAGDMCLG ALVHVGVPLN YLIEKLNLLG ISQEYQLSAE KVHRNGLVAT 
KFHVRLIHAN PEDGEPLTFD HHHDHHHTHN LENKSSTKKH HHTHHRHLPE IEALIKKAGL
PQRAEDWSLK VFRKLAEAEG AVHGIAPKQV HFHEVGATDA IIDIVGTCLG LDWLDIDLLY
CSPLPIGGGT VKAAHGRLPV PVPAVLKLWE LHNVPIYSNG LEKELCTPTG SAIACTFASS
FGPPPPMFLE RVGLGAGSQN LAIPNILRLW IGEGKASTQN FQITKSDAPT SQDIHLETVS
VLETQIDDCS PQTIAYTFDA LFAAGALDVF SQPVTMKKSR LGVLLTVICT PEKLSACEEV
IFQETTTLGI RCSIQQRSIL KREIHQVQTE YGAIRLKVAK KGEKIVNVQP EYEDCAALAR
QENMPLIEVH KMVLQSWQLH YSEGLK