Gene Tery_0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0337 
Symbol 
ID4243152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp516434 
End bp517675 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content35% 
IMG OID638105669 
Producthypothetical protein 
Protein accessionYP_720284 
Protein GI113474223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00896127 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCAG AAAAGGTAGT ATCCCAAACA TTTCGTCGAG TTCGTCCACG ATATCGTCCA 
GTAAAAAAAC GGCCAGTTAA AGTTCGTCGG CGAGTTAAAA CCAAAAATAT ACAGTTAATA
TTATTACAGT TAAAATATAA AAATATATTT ATTTTAGTTG CTTTATTAAT GACAGTGAGC
TGGATTATTA CCTTACCTTT TAGAGGTCGG CCAGCTTCAG AGAAACCTTT ACCTACTCCT
GCATCTTCTG TATCACCTAC CCTAATACCA CCCTATGTAC CCCCAGTTCC TGAAGAAACT
CCAAAAGATT TAGGATTTGC CTATAATGTT CGTAGACAAA CATACAAACG TAATAGCCCG
CAGTTACAAG AAATAGTTGA TGAATTACTC AGTATTGCTA AAGAAAAAGG GCTGCCTACA
GCGCCTCTAT CTATTAGTTT AATTGATGTT AGTAACCCAG ATCTTCACAC ATATGCTGGA
TATAAAAATC AAGTTTTAAG ATACCCTGCT AGTGTAGCCA AATTATTTTG GATGGCTGCA
TTTTATGGAG CAGTTGAACA AAGTTTAATT GATAATGAAC CGAAGTTTTA TGAAGATTTA
AGATTAATGA TGCAGAAATC TCATAATGAT TCTGCTAGTA GAATTTTAGA TGCAATTACT
GATACAAAAT CAGGGGTTAA ATTGGAAGGC AAAAAGTTAA ATACTTGGTT AGAAAAAAGA
AAATCAGTCA ATGAATTTTT TCAAAAAGCT GGTTACCAAG ATTTAATAGT TAGTACAAAA
AACTATCCAA GATATTCTCC TAGTCAAACA GGTCCAGTAG GTCGCGATCG CCAACTACGA
AAGCAAGATG GTAAGTTTCT CCGAAATTTG ATTTCAACTG ACCAGGCTAC CAGATTAATA
TATGAAATAT ACACTAGGCA GGCAGTTTCA CGAAAGTATA GTACGAGAAT GGCTTATTTA
TTAACGAGAG ATTTAAGACC CCAAGTATGG CAGAATGATC CCTACAATGG AGTCAAAGGT
TTTCTGGGAG AGTCTCTGCC TGCTAATATT TATTTCGGTT CTAAAGTTGG TTTGACTTCT
AAAGATCGTA TGGATGTCGC CTTTGTCAGA ACTTTAGATA ATCAAGCTAT TTATATTTTA
GCAATTTTTG CAGAAGATGC TGCTTATTCT AATGATGAAG AAATATTTCC TAAATTGTCT
CGTCATGTTT ATGATCGCAT GATGGCAATG GATAGCAAAT AA
 
Protein sequence
MHAEKVVSQT FRRVRPRYRP VKKRPVKVRR RVKTKNIQLI LLQLKYKNIF ILVALLMTVS 
WIITLPFRGR PASEKPLPTP ASSVSPTLIP PYVPPVPEET PKDLGFAYNV RRQTYKRNSP
QLQEIVDELL SIAKEKGLPT APLSISLIDV SNPDLHTYAG YKNQVLRYPA SVAKLFWMAA
FYGAVEQSLI DNEPKFYEDL RLMMQKSHND SASRILDAIT DTKSGVKLEG KKLNTWLEKR
KSVNEFFQKA GYQDLIVSTK NYPRYSPSQT GPVGRDRQLR KQDGKFLRNL ISTDQATRLI
YEIYTRQAVS RKYSTRMAYL LTRDLRPQVW QNDPYNGVKG FLGESLPANI YFGSKVGLTS
KDRMDVAFVR TLDNQAIYIL AIFAEDAAYS NDEEIFPKLS RHVYDRMMAM DSK