Gene Tery_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1917 
Symbol 
ID4242666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2966800 
End bp2970414 
Gene Length3615 bp 
Protein Length1204 aa 
Translation table11 
GC content34% 
IMG OID638107038 
Productvon Willebrand factor, type A 
Protein accessionYP_721645 
Protein GI113475584 
COG category[R] General function prediction only 
COG ID[COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.137165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAGTG AGTTTGAAAA GGTTCTAGAC GAGATTAGAA AACGAAGTAG TAAATATGAA 
GATATTGGAG ATGTTAAGAG GGAAATTATT AATGCGCTTT CACAAAGATA TAAAACACAG
GAACGTTTGT TTACACATTA TGTTCTAGAA AAAGGGGGAT TTGATATTCG TAAATTAACT
TTAAGTGACC CTCTGCTAGT AATTATATCT GATTTTTTTA ATCAGTGTAA TGGTCTAAAC
AAAAACGAAT TATATAAATT TTTTAGAGTG TTATATGATG AAAATAAAGG TATTAGTTTG
CTAAAAGAAA TTAGAGATAC ACTTGTACAA TTTAAACATC AAAGTACTGA TACTCCTACA
TATACACCAT CACCATACAT ATCAGTTATT ACTTCAGAAG CAATAGGCTT AGTAGAAGCA
AAATATGGAC AGTCTCTAGA ACAGGGAAAA AACTTTGCTA AAAAAGTTGA TTGTAATAAT
TTTCAGGATG TACAAGCTTT TTTTAATAAT GGGGGACAAG CTGGTAAACA GCTTGCTATT
TTGGAGCCAG GAACCTATTA CATTAACCCC GAAATATTCA CTATTCGTAC AGTTCCTATT
ATTAGGATTC TTCAAGGAGA AATAGGACTG GTAATAGCCA ATGAGGGGAC TTTTAAGTCT
GATGAACAAA CGTTAGGTAG AGTGGTGGAA TGTGATAACT TTCAGGATGC TGAAGCATTT
CTGAAAAATG GAGGACAAAA AGGTAAGCAA CTTGCTATTT TGACTGACGG TGACTATAAA
ATTAACACTG ATTTCTTTAG CGTAATTACT ACTACTAATG CCTATAAATA TAATGAAAAT
CCAAATAATT TAAAGGTTTA TAAAATCGAC AAAGATAAAA TTGGTATTTT AACTACGATG
GTTGGGAAAA TTCTCCCTAA AGGTGAAATT GCTGGACCAA TAATTGAAGG ACACGATAAT
TATCAAAATG CACAAAAATT CCTAGATTTA GGAGGATATA AAGGATTGCA AGAAGAAGTT
CTCCAAGAAG GAGCCTGGAG TCTAAATCCT TGGTTTGTTG AAGTTGAGCA GGTACCACTA
ACTAGAATTG AGCAAGAAGA GGTGGGAGTT GTTATATCTT TTGTTGGAAA AGAATATGAC
AAAAATTATG ATCAACAGCC CATTTTTTAT GGTGCAGAAA AGTCTCTATA CCAATTAGTT
CCAAAAGGGT ACAAAGGAGT TCAAAAAGAA CCTCTTACTG CTGGTCAATA TGCAATTAAT
ACCAGAGTAA AAACGGTCAA ATTAGTGCCA ACTACCCAAA TTATACTGAA TTGGTCAGAT
CAGAAAAAAC ATCCCTTAAA TTATGACTAT GAACTGAAAC AAATGAAGCT AATTTCTAGA
GACCATTTTG AAATTTTTGT TCAGTTTACC CAAATTATTC GTATTGCTGC TGAGAATGCT
CCTAAGATGA TTTGTATGGT TGGATATTAT ACAGGGGAAG ACAAAATATA TGTTACTGAT
GATTCTGGGC AAGTTGTTAA GAAATATGCA GTTATTAGAA ATTTAGTTTC TCGTGTATTA
ACAAAAGTTG TTTATAGTCA TTTTCAACAA GCTGCTACTG GTAAAACGGC AATACAGTTT
CTAAACACAA GAGGTGACTC TCAAAAGGAG GCCGAAAATT ATATAAAAAC ACTTCTTGAA
ATGATTGGTG TTGAAGGATG TGGCACTTTT ATGGTTGATA CAGTTAATCT ACCTCTAGTA
GTTGATTCTT ACTTACAAGA GAAACAAAAG CAAGAGGCAA GAGAAGCTCA AGCAAAAGCT
GCACCTGAAC GGCTTCTTGA ACCTGAATTT GTTGAGAATC CTGAACAACG GCTTCCTGAA
CCTGAATTTG TTGAGAATCC TGAAAACCGT TGTCCCATTA TCCTCCTACT GGATACATCT
TACTCAATGT CAGGAGAAGC TATTACTGAA TTAAATCAAG GAGTGAAAAT ATTTCAGGCA
AGTGTAAAGG AAGATGAACT GGCTTCCTTA AGAGTAGAAA TAGCTGTCAT TACTTTTAAC
AGTGAAATTG AAGTAGTTCA AGATTTTGTT ACTGTAGATA AATTTATTCC CAAAACATTA
GAAGCATCAG GAGTAACGCA CATGGGAAAA GCTATTGAAA AAGCCCTAGA ATTATTAGAA
AAGCGAAAAC AAGACTACAA AAATAGCGAT ATTCAATACT ATCGACCCTG GATCTTTCTA
ATTACTGATG GGCAACCTAC TGATACTTGG CAAGATGCAG CAAAAAAAAT AGAAGAAGCT
GAAACTAATA GAAAATTACT TTTTTTTGCT GTTGGGGTAA GAGATGCAGA TATGGAGACA
TTAAGTGAAA TTTCTGTATG CCCTCCTAAA AAACTCAACG GCTTAGATTT TCAATCTTTG
TTTAAATGGC TAAGTTTTTC ACTTCAGCAA GTTTCAGTTA GCAAGATAGG AGAAAAGAAT
AGACTTCCTC CAACGAATGC ATGGGAAGAA ATAACTAGTA AAAATCAAAA TACTAAACAG
ACTCAACAAA CTCAACAAAA GACAACAATT CCTAATTCTG ACCCTAATCC TGAACCAATT
ATCCCTATCT TAGTTTTAGA TAGAGATATT TTTAAAGATC AACGAATTAA GAGTAATTCG
GAGGGAGAAA TTTGGATAAC TCAAAAATAT CGCTACCGAA AGAAATACCT AATAAAAATT
TATTATGAAG TTACACCAGC AAGGATAAAA AAGTTAGAAG TAATGGTAGC TTATAAACCG
AAAAATTTTC ATGGTTCTCA ACAAGCGTGG GCTTGGCCTG AGTATTTACT AGCAGATAAA
ACAGGAAAAA TTATCGGCTT TGTCATGGAA TTTATTGAAG ATAGTAAACT GCTATTTAAT
ATTTATAATC CTCAGCGTCG TAAGCAAATA AATAGTCAAC TCCACTGGTC AGTAGACTGG
CTTTTTCTTC ACCATACTGC TAAAAATATT GCTACTATTA TTCAGTCTCT TCATAGTCAG
GATTATGTTA TTGGAGATAT GAAGCCACAA AATATTCTAG TTAACCGATA TGCTTCTGCT
TCAATAATTA ATACAGACTC ATTTCAAGTT CGCCATCCTC AGACAAAAGA AATCTATCAT
TGTTTAGTTG GTTCCGAAGA ATTTACCCCT CCTGAACTAT TAGAAAAAGA ATTAGCAAAA
ATTGTTCAAA CTCCTACCCA TGATAACTTT AGATTAGCCC TTATTATCTA TCATTTATTA
TTTGGAGGAC ATCCCTTTAA AGGAAGGTGG ATAGGAACAG AAGAGCCACC CAAAATTGAT
GAACTCATCC GACTAGGTTT CTGGTGCTAT GCTCCCAATA GTAAAATTCT ACCAGGACCG
AGAACTATTC CCCTTGAAAT AGTTCACCCC AAAATTCAAA AATGCTTCCA AAAATGCTTC
AACGATGGAC ACTATCATCC AGAAAAACGA CCCACTCCTC AGAACTGGGT TGACGCTTTA
GAGAGTGCCA TTAATGATTT AGTACAGTGT AAAAGAGTTG ATACCCATTG GTATAGTAAA
ACTTATGGCA AATGCTATTG GTGCGAAAGA GAAGAAAAAT TAGAGGTTGA CATATTTTCT
GATTCCAAAA CATAA
 
Protein sequence
MPSEFEKVLD EIRKRSSKYE DIGDVKREII NALSQRYKTQ ERLFTHYVLE KGGFDIRKLT 
LSDPLLVIIS DFFNQCNGLN KNELYKFFRV LYDENKGISL LKEIRDTLVQ FKHQSTDTPT
YTPSPYISVI TSEAIGLVEA KYGQSLEQGK NFAKKVDCNN FQDVQAFFNN GGQAGKQLAI
LEPGTYYINP EIFTIRTVPI IRILQGEIGL VIANEGTFKS DEQTLGRVVE CDNFQDAEAF
LKNGGQKGKQ LAILTDGDYK INTDFFSVIT TTNAYKYNEN PNNLKVYKID KDKIGILTTM
VGKILPKGEI AGPIIEGHDN YQNAQKFLDL GGYKGLQEEV LQEGAWSLNP WFVEVEQVPL
TRIEQEEVGV VISFVGKEYD KNYDQQPIFY GAEKSLYQLV PKGYKGVQKE PLTAGQYAIN
TRVKTVKLVP TTQIILNWSD QKKHPLNYDY ELKQMKLISR DHFEIFVQFT QIIRIAAENA
PKMICMVGYY TGEDKIYVTD DSGQVVKKYA VIRNLVSRVL TKVVYSHFQQ AATGKTAIQF
LNTRGDSQKE AENYIKTLLE MIGVEGCGTF MVDTVNLPLV VDSYLQEKQK QEAREAQAKA
APERLLEPEF VENPEQRLPE PEFVENPENR CPIILLLDTS YSMSGEAITE LNQGVKIFQA
SVKEDELASL RVEIAVITFN SEIEVVQDFV TVDKFIPKTL EASGVTHMGK AIEKALELLE
KRKQDYKNSD IQYYRPWIFL ITDGQPTDTW QDAAKKIEEA ETNRKLLFFA VGVRDADMET
LSEISVCPPK KLNGLDFQSL FKWLSFSLQQ VSVSKIGEKN RLPPTNAWEE ITSKNQNTKQ
TQQTQQKTTI PNSDPNPEPI IPILVLDRDI FKDQRIKSNS EGEIWITQKY RYRKKYLIKI
YYEVTPARIK KLEVMVAYKP KNFHGSQQAW AWPEYLLADK TGKIIGFVME FIEDSKLLFN
IYNPQRRKQI NSQLHWSVDW LFLHHTAKNI ATIIQSLHSQ DYVIGDMKPQ NILVNRYASA
SIINTDSFQV RHPQTKEIYH CLVGSEEFTP PELLEKELAK IVQTPTHDNF RLALIIYHLL
FGGHPFKGRW IGTEEPPKID ELIRLGFWCY APNSKILPGP RTIPLEIVHP KIQKCFQKCF
NDGHYHPEKR PTPQNWVDAL ESAINDLVQC KRVDTHWYSK TYGKCYWCER EEKLEVDIFS
DSKT