Gene Tery_3877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3877 
Symbol 
ID4243540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5995638 
End bp5998688 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content37% 
IMG OID638108806 
Productexonuclease SbcC 
Protein accessionYP_723388 
Protein GI113477327 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.371142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0827458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCTC AAAAACTACA ACTCAAAAAT TTTCTCAGCT ATCACCAAGC GACTTTAGAT 
TTCACAGGTC TGCACACTGC CTGCATTTGC GGACCTAACG GTGCTGGAAA ATCTTCCCTA
CTAGAAGCGA TCGCTTGGGC AATATGGGGA AATAGTCGGG CTGCCACTGA AGATGATATT
ATCAGTCTGG GAGAAAAAGA AACACGGGTA GACTTTACAT TCTCCACTCA CGGTAATATT
TACCGTGTTA TTCGTGGTCG TCGTCGCGGT CAGTCACCCA CCCTAGAATT TCAAGTTAAT
ACAGGTTCAC AGTTTAAAAG TCTAACTCAA AAAGGTGTCC GAGCAACCCA ACAAAGCATT
ATCGAATATA TCAAGCTTGA CTATGAGACA TTCGTAAATT CAGCATATTT ACGTCAAGGT
AGGGCAGATG AGTTTATGCT TAAACGCCCA AATGAACGAA AAGAAATTTT GGCAAGTCTC
CTAAAACTTG ACCAATATGA CACCCTGGGA GAAAAAGCAA AAGATATTGC TCGGCAGTTT
AAAGCAAAAG TCGAAATACT TGAACAAAGT CTAGAAGCGA ACGAAAAACA GCTAGAACAG
AAAGAAGCGA TCGCTCAACA ACAAAATAAC CTGAAAGCAA TTTTAGCTCA ACTGCAACAA
AAACAAGAAA CTGACAGACA ACAACTCAAA GAACTTCAAA CCAAACAACA TGAGCGCCAA
ACCTGGCAAA AATTACTGCA AGGTCAACAA CAACAATATG ACAAACTTGT GCAAGAATGT
TCTCGTCTCA AACAGGAACT AAAAGCTACT CAACAACAAC AAGATGAACT TATAGCAGTA
TTGCAACAAG AAAATGAAAT CAATGCTGGA TATGTTCATT TCCAAAACTT GCAAGCTACT
GAAGAAAGTT TATCTGCTAA GTTCAAAAAA CACCAGAATG CTCTCAAGCA AAGTCAACAA
CTCCAGGAAA AACAACGGCA AGAACTCCAG AAATTAGAAA ACAACATTCA GCAATTTCGA
GGGCAGTTAA ATTCCCTCAA AGAGCAGGAA CAAGAAAATC TAAATGTTCT AAGTAAGCGT
ACCGATATTG AAACGGCTTT AGCAGAACTA CAAGCAGCAA GGGTCAAATT AAATGAGCTG
GATAAGTTAC AAGTAGAAGT TACTCCTCTG GTGCAACAAC GCCAGAAATT ACAAACCGAA
CTCGAAAAGA GGCAGGCACG TTTGAGCGTA AGGTTGGAAG AACTTAATTC TCGAGTTTGT
CAACTACGAC AAACTCAACA GCAAAAGCAA CCACAACTAC AGGCAACTTT GCAGGAGGTT
GTAGCAGAAA TTGCTGTACT GGATAATAAG CGGGTTTATC AAGAACGGGT CAGGGAAAAA
GGGCAGGAAA GACATCAGTT TCTGGAACGT TTAATTGGCA ATAAAAAAGA TTATGAAACC
AGACTGGCAG AGTTGGGGCA AAAGTTACAG TTATTAAATA TAGGTAGTAA TAATGGTAAA
GTAGAAGTTA AAGAATATCC ACCTTGTCCT TTATGCGATC GCCCCTTGGA TGAACATCAC
TGGCATTTGG TGGTAGATAA ACATCAAACT CAGCAAAAGG AAATTAGAGA TCTGTTGTGG
GTGACACGGG AGCAACTGAC TATTTCTGAA CGAGAAATTC AGGTTTTAAG AACAGAATAT
CGGCAAATAG ACGAGGAGTT AGGTAAATAT GATCAGTTGC GGGAGAGCCG GGGTGGTTTA
CAGGTACAGT TGGATAATAT TAAACAGGAT GAAAGTTTGG TGCAAGAATT AGTAGAGGAA
GTGGCAAAAG TTGAGCGATC GCTACATACG GGTGATTTTG CTGTGAATTT ATATGCTGAA
CTTAGCACTT TAGAGCAGCA AATTCAACAA TTTAATTATG ATGAGCGCAG TCATAGTTTA
GCGAGGGAAG AGGAAAAAAA ATTAAGATGG GCAGAAATTA AGTATGGGCA AATTAAAGAT
GCAGAGATAA AGTTAGGAAA AGTTCGCGCT CTGATGCCAG AAATTGAAGA AAAAATAGTT
GGCTTAAAAC AAGATTTGGA GCTACAAAAA CAAAGTTCTC AAATACAACA AGAAATTGTG
GTTATTGAGA GAGAAATTGC TGAAATTGGG TATGATTTAG AGTTACATAA TAATGTGAGA
TTGGAGTTGA GAAAAGCACA GTTTTGGCTA ACTAAAATTG AAAAATTACG TCAAGCTCAT
AAACAATATC CTTACTTAAT TGAAAGGATA AATGAGTTAA AGGCAGTTAT AGAAAATCGA
CAGCAAGATT TAACAGATAT TAAAAGTCAA GTTTCTATTT TAAATCAACA GTTAGAAAAA
ACTCCAGAGT TGGATGAAAA AATTATCTAT TTAGACAAAA AAATTCAAGT TCGTCGCCAG
GAATTAGATG AAACATTGGG AAGTTTAGGC TCATTGGAAC AGCAGTTAAA ACATTTAGAA
AATTTGCAAC TTCAAAACGG TAAACAATTA GAAGAATTAC AAACTACGAA ACGGCAATAT
AGAGTTTATC AGGAGTTAAG TTTAGCTTTT GGGAAAAAGG GAATTCAGGC ATTAATGATT
GAAAATATTT TGCCTCAGTT AGAGGCAGAA ACTAATCAAA TTTTGGCGAG ATTAAGTGCG
AATCAACTGC ATATTCAATT TGTGACGCAA AAAGCTACAA AAGGAAGTAA GAAAAATGCT
AGATGGATAG ATACATTAGA TATTTTAATT GCAGATGCCA AAGGAACAAG ACCTTATGAA
ACTTATTCGG GAGGAGAAGC TTTTAGAGTT AATTTTGCAA TTCGTTTAGC TTTATCAAAA
TTGTTGGCAC AACGTTCAGG CACTGCTTTA CAAATGTTGA TTATTGATGA AGGTTTTGGA
ACTCAAGATG CTGAAGGATG TGACAGATTA ATTGCGGCTA TTAATGCTAT TGCTACTGAT
TTTTCTTGTA TTTTAACAGT GACTCATATT CCTCATTTTA AAGAGGCTTT TCAGGCTAGA
ATTGAGGTGA GTAAAACTGC CCAAGGTTCA CAAATTATTT TGTCTGTATG A
 
Protein sequence
MIPQKLQLKN FLSYHQATLD FTGLHTACIC GPNGAGKSSL LEAIAWAIWG NSRAATEDDI 
ISLGEKETRV DFTFSTHGNI YRVIRGRRRG QSPTLEFQVN TGSQFKSLTQ KGVRATQQSI
IEYIKLDYET FVNSAYLRQG RADEFMLKRP NERKEILASL LKLDQYDTLG EKAKDIARQF
KAKVEILEQS LEANEKQLEQ KEAIAQQQNN LKAILAQLQQ KQETDRQQLK ELQTKQHERQ
TWQKLLQGQQ QQYDKLVQEC SRLKQELKAT QQQQDELIAV LQQENEINAG YVHFQNLQAT
EESLSAKFKK HQNALKQSQQ LQEKQRQELQ KLENNIQQFR GQLNSLKEQE QENLNVLSKR
TDIETALAEL QAARVKLNEL DKLQVEVTPL VQQRQKLQTE LEKRQARLSV RLEELNSRVC
QLRQTQQQKQ PQLQATLQEV VAEIAVLDNK RVYQERVREK GQERHQFLER LIGNKKDYET
RLAELGQKLQ LLNIGSNNGK VEVKEYPPCP LCDRPLDEHH WHLVVDKHQT QQKEIRDLLW
VTREQLTISE REIQVLRTEY RQIDEELGKY DQLRESRGGL QVQLDNIKQD ESLVQELVEE
VAKVERSLHT GDFAVNLYAE LSTLEQQIQQ FNYDERSHSL AREEEKKLRW AEIKYGQIKD
AEIKLGKVRA LMPEIEEKIV GLKQDLELQK QSSQIQQEIV VIEREIAEIG YDLELHNNVR
LELRKAQFWL TKIEKLRQAH KQYPYLIERI NELKAVIENR QQDLTDIKSQ VSILNQQLEK
TPELDEKIIY LDKKIQVRRQ ELDETLGSLG SLEQQLKHLE NLQLQNGKQL EELQTTKRQY
RVYQELSLAF GKKGIQALMI ENILPQLEAE TNQILARLSA NQLHIQFVTQ KATKGSKKNA
RWIDTLDILI ADAKGTRPYE TYSGGEAFRV NFAIRLALSK LLAQRSGTAL QMLIIDEGFG
TQDAEGCDRL IAAINAIATD FSCILTVTHI PHFKEAFQAR IEVSKTAQGS QIILSV