Gene Tery_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1671 
Symbol 
ID4242826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2542658 
End bp2546200 
Gene Length3543 bp 
Protein Length1180 aa 
Translation table11 
GC content42% 
IMG OID638106806 
Producttranscription-repair coupling factor 
Protein accessionYP_721415 
Protein GI113475354 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTAT CATCTATTAT TAGGCTACTG GGGCGATCGC CTCTCACAAC CGAACTCCTA 
AACAAACTCC AACAATACCA ATGCTTAGGA CTCAACGGCG CTTCCCGTTT ACCTAAAGGT
TTACTCACAT CCACCCTCGC ACAACAACAA GAGCAAAATT TGCTGATAGT TACTGCAACT
TTAGAAGAGG CGGGCCGATG GGCGGTACAG TTAGAAGCAA TAGGATGGCC TAAAGTTCAC
TTTTACCCTA CTTCAGAAGC ATTCCCTTAT GAACCTTCCA ACTCAGAAGA AATGATTTGG
GGACAAATGC AGGTTTTAGC AGACTTAATA AGTGAGAATG TCCCAGAAAA TGTAACCAGC
AACGGCAACA GTAGCAATAA AGCAAAAATG GCCGTAGTAG CAACAGAGCG ATCGCTCCAA
CCTCACCTAC CTACAGTTAA AAAATTTCAA CCCTACTGCC TGACCCTTAT GGCCAGCATT
GCTAGTAGCT CAAAAACCTT AGGCTCAACT AAGATATCAA ACACAGATAT AGAGATAAAT
CATTCTGAGT CTGATAGTTT AGAGCGTCCA TCTTTAGAGG AAAAACTTGT CCAGATGGGA
TACGAACTAG TACCATTAGT CGAAACTGAG GGTCAGTGGA GTCGGCGAGG AGATATTATA
GATGTATTTC CCGTGGCCTC AGAATTACCG GTACGCCTAG AATGGTTTGG GGACGAACTC
AGACAAATTC GGGAATTTGA CCCTTCTACC CAACGTTCTC TTGACAAAAT TGCCCAAATA
GTATTAACCC CTAGAGATTT TAGTCAGTGG GCGTTAGGAA ATCTAGAGTC AATAACTGAC
TCAACATTTG TTTCCCTATT AGACTATCTA CCAGAAAATA CTCTGATTGC CATAGACGAA
CCAGATCAAT GTGCTGCCCA TGGCGATCGG TGGTTTGAAC TGGTAGAAGA AAACTGGCAA
CAACTCAAAA GATCTCAAGC ATTACCAAAA ATTCATCGTA CATTTGCAGA CTCCCTGGCA
GAAGCCGAAC TTTTTCCAAG ACTATATTTA TCGGAACTCA CCAAAGAAAC CAAGAGTGAT
ATTACCACCT CAGTTCCCTA TACCATTAAC CTAGCTAGTC GCCCTTTACC AGTCATACCA
CACCAATACG CCAAACTAGC AGAAAACCTG CGCCAACAAA GAGAACGTAA ACATTCCGTT
TTCCTAATTT CGGCTCAACC ATCACGCTCT GTCTCTTTAC TTCAGGAACA CGACTGTCCT
GCTCAGTTTA TTCCCAACTC CCACGACTAC CCTGCCATTG ACAAACTGCA AACCCAATAT
ACCCCTGTTG CTCTCAAATA TAGTGGTCTA GCAGAATTAG AAGGATTTAT TTTGCCCACA
TTTCGGTTAT CAGTTATCAC TGACAGAGAA TTTTACGGAC AACATACCCT CGCTACTCCT
ACCTATATTC GTAAACGCCG TCAAGCGACC TCCAAACAGG TAAACCCCAA CAAACTGCAA
CCGGGAGATC ATGTAGTTCA CCGCCAACAT GGTATTGGCA AATTTGTGAA ATTAGAAAGT
CTGACTCTAA ATAATGAAAC TCGCGACTAT CTAACAATTC AATATGCTGA TGGTTTGCTT
AGAGTTGCTG CCGACCAACT CAGTTCCCTG TCGCGGTTGA GAAGCACTGA TCATAAAAAA
CCCCAACTCA ATAAACTGAC TGGCAAAACC TGGGAAAGTA CCAAAAATAA AGTTCGTAAG
TCAATCAAAA AATTAGCTGT TGACCTGCTG AAGCTTTATG CTCAACGGGC TCAACAAACA
GGTTACAGTT TCCCTCCCGA CACCCCGTGG CAGGAAGAAA TGGAAGATTC ATTTCCCTAT
CAACCTACCC CCGATCAGTT AAAAGCAACC CAAGATGTGA AAAGGGATAT GGAAAGTGAA
CGAGCAATGG ATCGTCTCGT TTGCGGTGAC GTTGGTTTCG GCAAAACAGA AGTCGCCATC
CGAGCAATTT TTAAAGCAGT TATCGCCGAA AAACAAGTAG CATTTCTTGC TCCGACAACA
GTTTTAACCC AACAACATTA TCACACTCTT AAAGAACGTT TTGCTCCCTA CCCTATAGAA
ATAGGTTTGC TCAACCGCTT CCGTACTCCT AACGAAAAGA AAGAAATTCA GCATCGGTTA
GCAACGGGGG AATTAGATAT TATTGTCGGT ACTCACTCAA TTTTAAGTAA GACGATCCAG
TTTCGGGAAT TAGGTTTGTT GGTAGTGGAT GAGGAACAAC GGTTTGGTGT TAACCAGAAG
GAAAAAATTA AGGCGCTCAA AGCCGAAGTG GATGTGCTGA CACTGACGGC AACGCCAATT
CCCAGAACAT TATATATGGC ACTTTCAGGA ATTCGGGAAA TGAGTGTGAT TACAACTCCA
CCACCTTTGC GTCGCCCTAT TAAAACTCAC CTTGCACCTT ATGATCTTGA AACTGCCCGT
ACAGCAATTC GCCAAGAATT AAACCGGGGA GGACAAGTCT TTTATGTGGT GCCACGTATT
GAAGGTATTG AAGAATTGGC GGGAAAATTG CGAGAAATGA TTCCAGGAGC TAGGATTAAT
ATTGGTCACG GTAAAATGGA TGCAGCAGAG TTAGAGTCAA TTATGCTGAC TTTTAGTGCG
GGGGAAGCTG ATATTTTAGT TTGTACTACA ATTATTGAAT CTGGTTTAGA TATTCCCCGA
GTTAATACTA TTTTAATAGA AGATGCTCAA AAATTTGGCT TGTCTCAGTT GTATCAGCTA
CGGGGAAGAG TAGGTAGGGC AGGGGTGCAA GCTCATGCAT GGTTGTTTTA TCCTACCACG
AGCTCCGGTG GAATTGCACT AACGGATGAT GCACAAAAAC GGTTGCGAGC AATTCAGGAA
TTTACTCAGT TGGGTTCAGG ATATCATTTG GCAATACGAG ATTTAGAGAT TCGGGGAGCA
GGGGATATTT TGGGAGCAGA GCAGTCTGGT CAGGTGAACG CTATTGGATT TGATCTGTAT
ACGGAAATGT TAGAAGAGGC AATTCGGGAA ATCAAAGGTC AGGAAATTCC TCAAGTTGAT
GATACGAAAA TTGACCTAAG TTTGACTGCA TTTATTCCTG CAGATTATAT TTTAGATTTA
GATCAAAAAA TTAGTGCCTA TCGCTCGGTT GCTGCTGCTA ATACTAGAGA AGAATTGAGT
CAAATAGAAG TTGATTGGAG CGATCGCTAC GGTGCAATTC CAAAAGCTGG TTTACAGTTA
TTAAGGATGA TGGAATTAAA GCAAGTTGCT AAAAAAATAG GCTTTTCTCG GATTAAAGTA
GAAGGTAAAC AACACGTTAT TTTAGAAACA CCAATGGAGG AACCTGGATG GAATTTATTA
AAAGAGAAAT TACCAGGTCA TTTACAATCT CGTTTTGTTT TTAGTAAAGG TAAAGTTATA
GTGCGTGGTT TAGGTGTTTT AAGTGCAGAT AAACAGCTAG AAAGTTTAAT AGAATGGTTA
AGTAAAATGG AGGGTGCAAT ACCAAAAAGT CAATTTAGGG AACAGGGAGA AGGTGGATCT
TGA
 
Protein sequence
MTLSSIIRLL GRSPLTTELL NKLQQYQCLG LNGASRLPKG LLTSTLAQQQ EQNLLIVTAT 
LEEAGRWAVQ LEAIGWPKVH FYPTSEAFPY EPSNSEEMIW GQMQVLADLI SENVPENVTS
NGNSSNKAKM AVVATERSLQ PHLPTVKKFQ PYCLTLMASI ASSSKTLGST KISNTDIEIN
HSESDSLERP SLEEKLVQMG YELVPLVETE GQWSRRGDII DVFPVASELP VRLEWFGDEL
RQIREFDPST QRSLDKIAQI VLTPRDFSQW ALGNLESITD STFVSLLDYL PENTLIAIDE
PDQCAAHGDR WFELVEENWQ QLKRSQALPK IHRTFADSLA EAELFPRLYL SELTKETKSD
ITTSVPYTIN LASRPLPVIP HQYAKLAENL RQQRERKHSV FLISAQPSRS VSLLQEHDCP
AQFIPNSHDY PAIDKLQTQY TPVALKYSGL AELEGFILPT FRLSVITDRE FYGQHTLATP
TYIRKRRQAT SKQVNPNKLQ PGDHVVHRQH GIGKFVKLES LTLNNETRDY LTIQYADGLL
RVAADQLSSL SRLRSTDHKK PQLNKLTGKT WESTKNKVRK SIKKLAVDLL KLYAQRAQQT
GYSFPPDTPW QEEMEDSFPY QPTPDQLKAT QDVKRDMESE RAMDRLVCGD VGFGKTEVAI
RAIFKAVIAE KQVAFLAPTT VLTQQHYHTL KERFAPYPIE IGLLNRFRTP NEKKEIQHRL
ATGELDIIVG THSILSKTIQ FRELGLLVVD EEQRFGVNQK EKIKALKAEV DVLTLTATPI
PRTLYMALSG IREMSVITTP PPLRRPIKTH LAPYDLETAR TAIRQELNRG GQVFYVVPRI
EGIEELAGKL REMIPGARIN IGHGKMDAAE LESIMLTFSA GEADILVCTT IIESGLDIPR
VNTILIEDAQ KFGLSQLYQL RGRVGRAGVQ AHAWLFYPTT SSGGIALTDD AQKRLRAIQE
FTQLGSGYHL AIRDLEIRGA GDILGAEQSG QVNAIGFDLY TEMLEEAIRE IKGQEIPQVD
DTKIDLSLTA FIPADYILDL DQKISAYRSV AAANTREELS QIEVDWSDRY GAIPKAGLQL
LRMMELKQVA KKIGFSRIKV EGKQHVILET PMEEPGWNLL KEKLPGHLQS RFVFSKGKVI
VRGLGVLSAD KQLESLIEWL SKMEGAIPKS QFREQGEGGS