Gene Tery_4232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4232 
Symbol 
ID4245884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6533207 
End bp6535552 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content35% 
IMG OID638109128 
ProductATP-dependent DNA helicase Rep 
Protein accessionYP_723706 
Protein GI113477645 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01073] ATP-dependent DNA helicase PcrA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.491131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.505763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGCA CTGACTTTCT GACTCAGCTA AATTTATCCC AACGCCAAGC AGTAGAACAT 
TTCTGCGGAC CTATGTTGGT CGTGGCGGGT GCTGGGTCTG GCAAAACTAG GGCGTTGACT
TATCGTGTAG TCCATCTAAT ACGTCATCAT CGGGTACATC CAGAAAATAT TCTGGCGGTT
ACCTTTACAA ATAAAGCTGC ACAGGAAATG AAGGACCGCA TTGAAAAAGT ATTTGCTCAA
GAACAAGCGG AAGCTAAATA TAACAAACCA TTTTCAGCAT TGACATCAGA AGAACAAATC
AGGTTGCGAT CGCAAGTCTA CAAAAATATC ACTAAACATT TATGGGTGGG AACTTTTCAT
AATCTTTGCG CTCGTATTCT GAGGTTTGAT ATTAACAAAT ATCAGGATGA AAAAAAACGT
CATTGGGATA AAAATTTTTC TATATTTGAT GAAAGTGATG CTCAAAGTTT AATTAAGCAA
ATTGTCACTA AACAGCTAAA TTTGGATGAT AAAAAATTTG AACCACGTTC TGTGAGATAT
GCCATTAGTA ATGCTAAAAA CCAAGGAATG TCACCTCTAG AATATCAAAG AGCCGAGCCA
GATTATCGGG GACGGGTAAT TGCTGAAGTT TATGGAATTT ATCAAGATAA TTTAGCTGCG
AACAATGCTC TTGATTTTGA CGACCTAATC AGAATACCAG TAGAATTATT TCGGCAAAAT
GAGCAAATAT TAGCTTATTG GTATCAACGT TTTAATCATA TTTTAGTAGA TGAATATCAA
GATACTAATC GGACTCAATA TAATTTTATT AGATTTTTAG CTACTAATGG TGAAGACCCT
AAATATATTA AAAACTGGGA AAATCGTTCT ATTTTTGTGG TAGGAGATGT AGACCAATCT
ATTTATTCTT TCCGCATGGC AGATTATACA ATTTTGCTTG ATTTTCAGAA TGATTTTGGC
GATGGTTTAG CTGATGAATA TACTCAGACA ATGATTAAGT TGGAGGAAAA TTATCGCTCA
CGGGAAAATA TTTTGGCAGT GGCAAATAAG TTGATTGAAA ATAATACTCA ACGTATTGAC
AAAACTCTCA AACCAACCAG GGGCATAGGG GAAGAAATTT ATTGTTATGA GGCAGAAAAT
GAGTTAGAAG AAGCAGAATT TATTTGTAGT AAAATTGCAG AAATTACAGA CCAATATCCA
GACTTAGATT TAGGAAGCTT CGCTGTACTT TATCGAACAA ACTCTCAGTC ACGCTCCCTC
GAAGAAAAGT TGATTCATTA TGATATTAAA TATGTCATAA TTGGGGGATT GAGATTTTAT
GATCGGAAAG AAATTAAGGA TGCTTTAGCT TATTTACGAG TCATTGCAAA TCCTGCTGAT
ACTGTTAGTT TACTCAGAAT TATTAACACT CCGAGAAGGG GTATTGGCAA GGCAACTATT
GATAGTTTAT TAAATGCTAG TTCCCAGATG GGAATACCTT TGTGGGAAAT TATTAATGAT
CAAGCTTCAG TAAATGCTTT GGCAGGTCGT TCATCAAAAG CTGTAAATAA GTTTGCCGAA
GTAATTCAAC ATTTGCAAGA TGAGTTAGAA AATTTGACTG CTCTGGAAAT TGTTGAGAGA
ATTTTAGAAA ATTCTGGTTA TATTGAAAAC TTGAAAAAAC AGGATACGGA AGATGCAGAT
AATCGACTGG CAAATTTAGG AGAATTATGT AGTGCTGTAG CTCAGTTTCA AGAAGATAAT
GAAGATACAA CTTTAGGAAG TTTTTTAGCA AATGCTTCTT TAGCTTCTAA TTTAGATAAC
CTTCAAGATG GGCAGGAAGC TGTGTCTTTA ATGACTTTAC ATTCTGCTAA AGGTTTAGAA
TTTCCCGTAG TATTTATAGT GGGTTTGGAG CAAGGTTTAT TACCTCATTT TCGGAGCATA
AATGACCCTT TATCTTTAGA AGAAGAAAGG CGACTTTGTT ATGTAGGTAT TACTAGAGCC
GAAGAACAAT TATTTTTTTC TTATGCAACT GAAAGACGAC AGCTTTGGGG AGCAAGAGAT
GCCACAGTTC CATCTCAATT TTTGGGAGAA TTACCAAGAG ATTTGATTAA TACTAATGGG
ATGAAAAAAG TAATTTATCC GTCAAAGCAT CAAAGGAAAA ACACAAAAAA TACTGTTGGA
AAAAAATCGG TAAGTAATCA AATAAAAAGT TGGCAAGTCG GGGATAAAGT GATGCATGAA
AGTTTTGGTG TGGGGTTAGT AACAAACATT CTGGGTGAAG GACATAAGAT GAGTTTAGGT
ATCAAATTTG GTAAGAGTAA AAAAATTATT GATCCCAAAA CGCCATCAAT AGAAAAGTTG
AATTAA
 
Protein sequence
MPSTDFLTQL NLSQRQAVEH FCGPMLVVAG AGSGKTRALT YRVVHLIRHH RVHPENILAV 
TFTNKAAQEM KDRIEKVFAQ EQAEAKYNKP FSALTSEEQI RLRSQVYKNI TKHLWVGTFH
NLCARILRFD INKYQDEKKR HWDKNFSIFD ESDAQSLIKQ IVTKQLNLDD KKFEPRSVRY
AISNAKNQGM SPLEYQRAEP DYRGRVIAEV YGIYQDNLAA NNALDFDDLI RIPVELFRQN
EQILAYWYQR FNHILVDEYQ DTNRTQYNFI RFLATNGEDP KYIKNWENRS IFVVGDVDQS
IYSFRMADYT ILLDFQNDFG DGLADEYTQT MIKLEENYRS RENILAVANK LIENNTQRID
KTLKPTRGIG EEIYCYEAEN ELEEAEFICS KIAEITDQYP DLDLGSFAVL YRTNSQSRSL
EEKLIHYDIK YVIIGGLRFY DRKEIKDALA YLRVIANPAD TVSLLRIINT PRRGIGKATI
DSLLNASSQM GIPLWEIIND QASVNALAGR SSKAVNKFAE VIQHLQDELE NLTALEIVER
ILENSGYIEN LKKQDTEDAD NRLANLGELC SAVAQFQEDN EDTTLGSFLA NASLASNLDN
LQDGQEAVSL MTLHSAKGLE FPVVFIVGLE QGLLPHFRSI NDPLSLEEER RLCYVGITRA
EEQLFFSYAT ERRQLWGARD ATVPSQFLGE LPRDLINTNG MKKVIYPSKH QRKNTKNTVG
KKSVSNQIKS WQVGDKVMHE SFGVGLVTNI LGEGHKMSLG IKFGKSKKII DPKTPSIEKL
N