Gene Tery_4538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4538 
Symbol 
ID4246192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7002260 
End bp7003756 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content32% 
IMG OID638109415 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_723991 
Protein GI113477930 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR02765] cryptochrome, DASH family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0705059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA ACGTAATTAT ATTATGGTAT CGTAATGACC TACGCATCCA TGACCATGAA 
CCACTATATA AAGCACTCAA AGTTAATGCT CAAATTATAC CGATTTATTG TTTAGATCCA
AGGCAATTTA GTCAAACAGA TTTTGGTTTT CCTAAAACAG GTGTATTTAG AGCAAAGTTT
TTACTAGAAA GTATTGCTGA TTTGCGTAAC AACCTACAAA AATTAGGTAG TAATTTAGTT
ATTTTTCAAG ATAAACCAGA AATAGTAATT CCTAGACTAG CTCAACAATT ATCTGCTAAA
TCAGTATTTT TTCATCAAGA AGTTACTGAG CTCGAAGTTA AAGTAGAAAG ATTAGTTCAT
CAAGCACTAA AACAAATTGG AGTTAGGTTA AAATCATTTT GGGGTCATAC ACTTTACCAT
CCAGATGATT TACCTTTTGA AATAAAACAA TTACCAGAAT TATTTACTAC TTTTCGTAAA
GATGTAGAGA AAAATTCTAG TGTAAACCCT ACATTCTCAA TCCCTAAAAA ATTATCATCC
TTACCAAAAA TTGATGTGGG AGAATTACCT ACATTATCTG ATTTAAATCT AGAAAAACCG
CCACTAAATT CACAGGGAGT TTTAGAATTT AAAGGTGGAG AAACTGCCGC TAAGGAAAGA
GTAAAAAACT ACTTTTGGCA GCAAGACTAT TTAAAAGTTT ATAAGGAAAC CAGAAATGGA
ATGCTAGGTG CCAATTATTC TTCTAAATTT TCCCCTTGGT TAGCTTTAGG ATGTTTGTCA
CCTCGTTATA TTTATGAAGA AGTTAAAGAA TATGAATATC AAAGAGTCAA GAATCAATCA
ACTTATTGGT TAATATTTGA GTTAATATGG CGAGATTACT TCCGATTTAT TTGTCAAAAA
CATGGGAATA AAATATTTCA TAAATCTGGT TTACAAGGTA TAGCTATTCC TTGGCAAGAG
GATTGGGAAA AATTTAGAAA GTGGCAAGCA GGTCAAACAG GATTTCCTCT AGTAGATGCT
AATATGCGTG AACTTTTAGC TACAGGTTTT ATGTCAAATC GTGGTCGACA AAATGTTGCT
AGTTTTCTTA CTAAAAATTT AGGAATTAAT TGGCAAATGG GAGCTGAATG GTTTGAGTCT
TTACTAATAG ATTATGATGT TTGTAGTAAT TGGGGAAATT GGAATTACAC TGCTGGAGTA
GGAAATGATG GGCGAGGTTT TCGCTATTTT AATATTCCTA AACAAGCAAA AGATTACGAC
CCAGAAGGAA AATATGTTAA ACATTGGTTA CCAGAATTAG GAAAAGTTCC TCCTGCTAAA
GTACATGAAC CTTGGAAATT ATTACCTGTT GAACAAGATA GATTTGGTGT CAAAATAGGG
GTAGATTATC CAGAACCAAT TATTGATTTA TGGCAGTCAG TTAAGGAGAA TGAAAAAAAA
TATAATAGAG CGTTACAAAT GACACTAGGT AAAATAAATA AAAGAGGCAG ATTTTGA
 
Protein sequence
MSQNVIILWY RNDLRIHDHE PLYKALKVNA QIIPIYCLDP RQFSQTDFGF PKTGVFRAKF 
LLESIADLRN NLQKLGSNLV IFQDKPEIVI PRLAQQLSAK SVFFHQEVTE LEVKVERLVH
QALKQIGVRL KSFWGHTLYH PDDLPFEIKQ LPELFTTFRK DVEKNSSVNP TFSIPKKLSS
LPKIDVGELP TLSDLNLEKP PLNSQGVLEF KGGETAAKER VKNYFWQQDY LKVYKETRNG
MLGANYSSKF SPWLALGCLS PRYIYEEVKE YEYQRVKNQS TYWLIFELIW RDYFRFICQK
HGNKIFHKSG LQGIAIPWQE DWEKFRKWQA GQTGFPLVDA NMRELLATGF MSNRGRQNVA
SFLTKNLGIN WQMGAEWFES LLIDYDVCSN WGNWNYTAGV GNDGRGFRYF NIPKQAKDYD
PEGKYVKHWL PELGKVPPAK VHEPWKLLPV EQDRFGVKIG VDYPEPIIDL WQSVKENEKK
YNRALQMTLG KINKRGRF