Gene Tery_4471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4471 
Symbol 
ID4246124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6897392 
End bp6900367 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content34% 
IMG OID638109354 
Producthypothetical protein 
Protein accessionYP_723931 
Protein GI113477870 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.756132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0586303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCG ATATTACTGA AAAAGGTCTC GAAAATATTA TCTATCAAAG TCTCATCGAC 
GACTGCCAAT ATTTAGAAGG TAACCCCAAA GACTACGACC AAACCTACTG TATCGACACT
GAGAAACTTT TCCAGTTTCT CCAAAACACC CAACCTGAAA AATTAACAGA AATTTCTAAC
TACCACGGCG CCAACTGGGA GAAAAAACTT TATGAACGCC TCCACCACCA AATAGAAGAG
AAAAGTATAG TCAATATATT ACGTCAAGGT ATCAAAACTG GAGAAACCCA CCTCGAACTT
TACTATAAAC TTCCCACTTC CCAACTCAAC CCCGACACTA TCGAAAATTT CCAAGAAAAC
GTTTTTTCAG TCACTCGCCA ACTAAAATAC AAAGAAAACC GTAACTTCTC CCTCGACTTA
GTAATTTTTA TAAACGGTTT ACCAGTTATT ACCTTCGAGC TAAAAAACCA ACTAACCAAA
CAAAACTTTC GAGACGCCAT AAACCAATAT AAAAATGACC GACGCCCCAG AGAATTATTA
TTTCAATTCA AACGTTGCCT AGTACATTTT GCCCTCGACG CCGATGAAGT TTGGATGACA
ACAAAACTCA ACGGCAAAAA TACAGAATTC ATACCATTCA ACAAAGGCAA AAAATCAAAT
CCTGATCTAC CTTTTCCGGA TACAGCAGGG AACCCTCCCA ACCCCAACCA CATCAAAACA
GATTATTTGT GGAAAGAAAT TTTAACCATA GAAAGTCTCG GAAACATCAT CGAACATTAC
GCCCAACTGA TAGAAAAAGA AGAAGATAAA GACAAAGACA AAAAAACAGT CAAAAAGCTA
AAACTAATCT TCCCCCGCTA CCATCAACTC GACCTAGTCA AGCAACTTTT AACAAGTGCA
AAAAAACATG GAGTCGGCAA CCGCTACTTA ATCCAACATT CTGCAGGTTC CGGCAAAAGT
AATTCTATAA CCTGGCTGAG TCATCAACTC GTAGAACTGA AAAATATTAC CGAGAAAGAA
AATATTTTTG ATTCAGTTTT AGTCGTGACA GACCGCAAAA TTTTAGATAA ACAAATTAGG
GAAAATATTC AACAATTCGC TCAAGAAGAC AAAGTCGTAG AAGCAACCAA AAACAGCAAA
AAATTAAAAT CAGCCTTGGA AAACAAACGG AAAATTATTA TTACAACAGT GCAAAAATTT
CCATATGTTG TCAAAGAAAT TCAATCTTTA TCCGATCACA AGTTTGCCAT TATTATCGAC
GAAGCACATT CGAGTCAAAC TGGCAAAAGT GCAGCCAGCA TGAGTGAATC TTTGAGCAAA
AAAGATTCGG AAGTAGAAGA AACCACAGAG GATAAAATAA TCCGAATTAT TGAGTCACAA
AAACTTTGCC CAAATGCTAA TTATTATGCA TTTACTGCCA CGCCAAAGAA TAAAACTTTA
GAGTTATTTG GTGTCAAAAA TCCAGAAGAT GGAAAATTTT ATCCGTTCCA TAGTTATTCC
ATGAAGCAGG CAATTGAAGA AGGATTTATT CTGAATGTTT TGCAGCATTA TACGACCTAC
AAAACCTATT GTCGATTAGA GAAGAAAGTT ATAGACGACC CTGAATTTGA TAGTAAACAA
GCAAAAAAGA AGTTAAAACA ATATGTAGAA GAGGATCAAG AGAGTATCCG CAAAAAGTCA
GAAGTGATGA TTGAGCATTT TTTATCAAAG GTAATTGCTC AGGGAAAAAT TAATGGAAAA
GCTAAGGCTA TGGTCGTTAG TAATAGTATT AAAAGTGCGA TTTATTATAA AAAAGCTTTT
GATAAATATT TGAGAGAAAA AAAATCTGAT TATCAGACTA TTGTTGCTTT TTCTGGAAGT
AAAGAAATAG ACGGCAAAAA GGAAAATGAG TCTTCTATGA ATGGATTTTC TAGTAGTAAG
ATTACAGAAA AATTTAATGA TAGTAAATAT AGGTTTTTAA TTGTGGCTAA TAAGTATCAA
ACTGGTTTTG ATGAACCGTT GTTACATACT ATGTATGTGG ATAAAGTTTT ATCTGATGTG
AAAGCAGTAC AAACTTTGTC TAGGTTAAAC CGTTCTTGTG AGGGAAAAAC AGATACTTTT
GTTTTAGATT TTGTTAATTC TGCTGATGAA ATTCAGAGAG CTTTTGAACC TTATTATAAA
ACAACTATTT TGAGTGAAGA AACAGATAGC GATCGCCTCT ATGATTTAGA GGATAGTTTA
GCAAGTTTTC AGATTTATTC TCAAGAAAAT GTAGAGAAAT TTATGAAGCT TTTTTTGAAT
TGTGAGTCAC GGGAAAATTG GGAGTCAATT TTAGATATTT GTGTGGAAAA ATATAATTGT
GATTTGCTAG AGGAGGAAAA AATAGAGTTT AAAAGTAAAG CCAGGAGTTT TGTGAAAAAT
TATCAATTTT TGGTGCAAGT AAAAAGTTTT AAAAATTCCA ATTGGGAGAG TTTAAATAGT
TTTCTGAAAT TGTTAGTTAA TAAACTGCCA CAATTAGATA ATTCTGATTT ATCGGCAGGA
ATTATTAATA GTGTGGATAT TGAGAGTTAT CGAGTAGAGC TTCTAGCTAG TCAAAGTATT
AATTTAAGTG GAGAAAATAC CCTATCTCCC ATTGCCAAGA ATATTGTTAG TGGAAATTCT
CAAAGTAGGT CAGATAAAGT TAGTCAAATA ATCGAAGAAT TTAATAACCG CTTCGGTGGT
AATATTGTTT GGCAAAATGA GGGTAGGGCA TGGAAATTTT TATTAGAGGA GTTGCCAGAA
AAAGTCAGAG GAAATGGGGA GTATAAAAAT GCTATAAATT ATAGCGATCC GCAAAATGCC
AAACTTACCT TTGAAAATAA ATTCAATCAA GAATTACGGC GTTCTACCCG TGAACATATA
GAAGAATATC GTCAATTTAC AGGTAATAAA AGTTTTCGAG AATGGTTAAT TAATACTTTA
TTTAATCTTG ACTACGAGCA AGATAAAAAT GCTTAG
 
Protein sequence
MASDITEKGL ENIIYQSLID DCQYLEGNPK DYDQTYCIDT EKLFQFLQNT QPEKLTEISN 
YHGANWEKKL YERLHHQIEE KSIVNILRQG IKTGETHLEL YYKLPTSQLN PDTIENFQEN
VFSVTRQLKY KENRNFSLDL VIFINGLPVI TFELKNQLTK QNFRDAINQY KNDRRPRELL
FQFKRCLVHF ALDADEVWMT TKLNGKNTEF IPFNKGKKSN PDLPFPDTAG NPPNPNHIKT
DYLWKEILTI ESLGNIIEHY AQLIEKEEDK DKDKKTVKKL KLIFPRYHQL DLVKQLLTSA
KKHGVGNRYL IQHSAGSGKS NSITWLSHQL VELKNITEKE NIFDSVLVVT DRKILDKQIR
ENIQQFAQED KVVEATKNSK KLKSALENKR KIIITTVQKF PYVVKEIQSL SDHKFAIIID
EAHSSQTGKS AASMSESLSK KDSEVEETTE DKIIRIIESQ KLCPNANYYA FTATPKNKTL
ELFGVKNPED GKFYPFHSYS MKQAIEEGFI LNVLQHYTTY KTYCRLEKKV IDDPEFDSKQ
AKKKLKQYVE EDQESIRKKS EVMIEHFLSK VIAQGKINGK AKAMVVSNSI KSAIYYKKAF
DKYLREKKSD YQTIVAFSGS KEIDGKKENE SSMNGFSSSK ITEKFNDSKY RFLIVANKYQ
TGFDEPLLHT MYVDKVLSDV KAVQTLSRLN RSCEGKTDTF VLDFVNSADE IQRAFEPYYK
TTILSEETDS DRLYDLEDSL ASFQIYSQEN VEKFMKLFLN CESRENWESI LDICVEKYNC
DLLEEEKIEF KSKARSFVKN YQFLVQVKSF KNSNWESLNS FLKLLVNKLP QLDNSDLSAG
IINSVDIESY RVELLASQSI NLSGENTLSP IAKNIVSGNS QSRSDKVSQI IEEFNNRFGG
NIVWQNEGRA WKFLLEELPE KVRGNGEYKN AINYSDPQNA KLTFENKFNQ ELRRSTREHI
EEYRQFTGNK SFREWLINTL FNLDYEQDKN A