Gene Tery_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0644 
Symbol 
ID4242782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1051875 
End bp1053653 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content37% 
IMG OID638105945 
ProductDNA repair protein RecN 
Protein accessionYP_720558 
Protein GI113474497 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.659362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0659728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTATAT CTCTTCGCAT TAATAACTTT GCCCTCATAG ACCATCTCGA ACTAAAATTA 
GGTTCCGGTC TAAATGTCTT CACAGGTGAA ACAGGTGCCG GAAAGTCCAT TATCTTAGAT
GCTCTTAATG CTGCCCTAGG TGCTAAGATA GACCGTCGTA TTATCAGAAC AGGAACTAGA
CGAGCTATTT TAGAAGCAAC CTTTAAAATT AATCCAGATC TAGCTGAATG GTTAAGAGGT
CAAGAAATAG ATCCGGTAGA TGATACATTA GTAATTTGTA GTCGCGAAAT TATTGTAACT
CAAGACTCCC TGCGCACAAG GTCTCGTGTA AATGGGGTTC TAGTTAACCG GAAAATAATA
GACCAACTTC GCGATCGCTT AGTAGAGATT ACAGCTCAAG GCCAGACTTT ACAATTGGGA
AATACTAATC TGCAAAGAGA ATGGCTTGAT TTATATGGTG GTTATTCCCT ACTCAAATGT
CGAGAAGCCG TAAATATTAG CTATCAAAAA GCTCAAAAAA CCAAAGTAGC GTTAGAAAAC
CGTCGCCAGT CAGAGCAGCA ACGTTTACAA AGAATAGACT TATTAGAATA CCAGGTACAA
GAATTAGATA AAGCTAATCT TACTGAGCCT CAAGAATTAG AGCAACTAAT ACAAGAAAGT
CAACGCCTTA GTCATGTAGT TGAACTGCAA CATCAAAGTT ATCAAATATA TCAAGCTTTA
TATCAAAATG AAGGTGATAA TTCAGCAGCA GCAGATCTTT TAGGAAAAGC AGAAACAATA
TTAAATGATA TGGTAAACTA CGACTGCACA TTACAATCAA TATCAGAAAT GATTAGTGAA
GCTCTAGCTC AAGTTGTTGA AGCTAGCCGA CAAATTAGCA GTTATGGTGA GCAATTAGAA
GCAGATCCTC AAAGATTACA GGATGTAGAA GAAAGAATTC AAGAACTTAA ACAAATTTGT
CGTAAGTATG GACCCACTCT AGAAGAAGGC ATTAATTATT ATCACAGAAT ACAAGCAGAA
CTTAAAGAAC TACTTGAAGG AGGAGAGTCC CTTGAAGCAC TAGAAAAAGT ATATCAAGAA
GATCAGATTG AATTACAAAA TAAATGTCAC GAACTTAGTT TGCAACGCCA TAGTGCTGCT
GCTAAATTGG AAACACATTT GGTAGAAGAA TTAAAGCCCC TTGCTATGGA GAAAGTGAAA
TTTAAGGTAC AAATTACTTC TATTACTCCG ACAATTACAG GTGCTGACCA CCTTACATTT
TGTTTTAGTT CTAATCCTGG TGAACCACTA CAACCTCTAA ATTTAACTGC TTCTGGAGGA
GAAATGAGTC GGTTTTTATT AGCACTGAAA GCTTGTTTTT CTCAAATTGA AGCATCAGAT
ACTCTAGTTT TTGATGAAAT AGATGTAGGA GTTTCAGGAA GAGTAGCAGG AGCTATTTCA
GAAAAGTTAC ACCAACTTAG TCGCCAACAT CAAGTTCTAT GTGTAACTCA CCAACCTATA
GTTGCTGCAA TGGCAGACCA TCATTTTAAT GTTAGCAAGC AAGTAATTCA GCAAGTAGCT
AATGTTGATT CAAATGATCA CAATTCTGAG GAGAGAACAA TAGTTAGAGT GAAAAGCCTG
GATAATTATC AACGACGAGA AGAGCTCGCA CAATTAGCTA GTGGTAGATC AGTTCAAGAA
GCTATTGCTT TTGCAGAATC TCTTTTAACT CAAGCCGCAA CTAAACGCGA GAAAAATTCG
ATTACTCCTA TTTCTAGTTC CAGTTTTGGA GGTGAATAG
 
Protein sequence
MLISLRINNF ALIDHLELKL GSGLNVFTGE TGAGKSIILD ALNAALGAKI DRRIIRTGTR 
RAILEATFKI NPDLAEWLRG QEIDPVDDTL VICSREIIVT QDSLRTRSRV NGVLVNRKII
DQLRDRLVEI TAQGQTLQLG NTNLQREWLD LYGGYSLLKC REAVNISYQK AQKTKVALEN
RRQSEQQRLQ RIDLLEYQVQ ELDKANLTEP QELEQLIQES QRLSHVVELQ HQSYQIYQAL
YQNEGDNSAA ADLLGKAETI LNDMVNYDCT LQSISEMISE ALAQVVEASR QISSYGEQLE
ADPQRLQDVE ERIQELKQIC RKYGPTLEEG INYYHRIQAE LKELLEGGES LEALEKVYQE
DQIELQNKCH ELSLQRHSAA AKLETHLVEE LKPLAMEKVK FKVQITSITP TITGADHLTF
CFSSNPGEPL QPLNLTASGG EMSRFLLALK ACFSQIEASD TLVFDEIDVG VSGRVAGAIS
EKLHQLSRQH QVLCVTHQPI VAAMADHHFN VSKQVIQQVA NVDSNDHNSE ERTIVRVKSL
DNYQRREELA QLASGRSVQE AIAFAESLLT QAATKREKNS ITPISSSSFG GE