Gene Tery_4768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4768 
Symbol 
ID4246422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7323658 
End bp7325688 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content31% 
IMG OID638109619 
Productsulfotransferase 
Protein accessionYP_724195 
Protein GI113478134 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.260339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.153237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCAGG AAACATCGGC TATCAATTTT AACCAAACAG CAGAGTTTTA TTTATCCCAG 
GGAAAATTAG AAGCAGCTTA TGAAATTTGT CAAAACATCT TAGGGGATTT ACCTAACTTT
GCTCCCGCCT ACAATACTCA AGGAAAAGTA TTACAAGCAA TGGGAAAAAT AGAATCAGCA
ATTATCAGTT ATCGCCAAGC AATAAAATTA AATCCTCAAC AAATTGAAAC TTATAAAATA
TTAGGAGATA TCTTAGTAAA ACAAGAACAA TTATCTGAAG CGATCGCCTG TTATGAAACA
GGTATTAAAT ATAACCCAAA AGCATCATTA TTTTATCATA AATTAGGCTT AGTATTAATA
CAACTCAAAA GCTGGGATGA AGCAGTCAGC GCCTTCTGTC GTGCAATTCA ATTTAATCCA
AATTTTCCCT GGTCTTACTA TAAATTAGGA GAAGCTTTAA CTCAACAAAA AAAATGGCAT
CAAGCGGTTA TTGCTTATCA ACGTTCTATT GAGATTAAAC CAGATTTATG TTGGTCTTAT
CAACATCTAG GTAATGCTCT AATTAAACAG GGAAAAATAG ATCAAGCGAT CGCCTATTAT
CAAGAAATAC TTCAACAGCA ACCACACCTT GATAGAATTC ATAAATTATT AGCAGATGCT
TTAGTCGAAA AAGGTGAAAT TGATGGAGCA ATTCCTAATT ATCTCAAAGC TATTCAGTTA
AATCCAGATT TTCCTTGGTC TCATGTATGT TTATGGGAAA TATTTCTCAA AAAAGATCAA
TGGAATGAAG CAGTTATTAT TTATCGCCAA GCAATTAAAC TTAACCCTAA TGCTTTTTGG
TTATGGACTT ACTTAGGAAA TGCTTTGGTT AAACAAGGAG ATTTAGAAAC AGCAATTACT
TGTTATCAAA AAGCTATTTC TATTCAACCC AATATTTCGA AAATTTATCA ATTTTTAGGA
GATGCTTTTG TTCAACAACA AAAGTGGGAT GAAGCTGCTT TTGCTTATCT GCGTGCTATA
GAAATTAACC CAGAATTATC CTGGTCAAAT TATCATTTAT GGAATACCTT AGACCGCTGC
CATAAATTAG ATGCAGTAGT CAATTTATAT CGGCAGTTTA TTAAAAAAAA TCCAGATTCA
TTTTTATCTT ATTTACGCCT TGGAAAAATC CTAACAAAAC AGAATCAAAT AAATGAAGCA
ATTATCTGTT ATCAAACTGC TTGTTACCAA CAAACTATAA AATCTTATCC CTATCTTTTC
AAAAAAAAGT GGGATTTTAC TGAAGTTAAA AATCCTAATT TTCTGATTAT TGGAGTAGGA
AAAGGAGGAA CTACATCTTT ATTTAGTTAT CTGATACAAC ATCCTCAAGT TTTGCCTCCT
GTGGTGAAAG AAGTAGATTT TTGGTCAATT AATTTTAAGA ACGGTATAAA TTGGTATCTG
TCTCATTTTC CTGCTCTTCC TAGCAATCAA AATTTCATCA CTGGAGAAGG AAGTCCTAGT
TATTTAGGAA ATTTGGAAGC TCCAGGTAGA ATATTTAGTT ATTTCTCCAA AATCAAGTTA
ATTATTATTT TGAGAAATCC TGTAGATAGA GCTATTTCTC ACTACCATTA TTGGTTGAGA
ATAAATAGAG AAAATCGTTT GTTAGAAACT GCATTAAATC AGGAATTAGA AAGTTGGAAA
ATGATTTATA AAAATTCTCC TTTAGATAGC AGTTATTGGC ATCATGGATT ATATTATTTA
GGCACTGGCA TATATATAGA TTTTATCCAC AATTGGATGA GCATATTTCC GAAAGAGCAA
TTTCTGATTT TATCCACAGA AGAATTTTAT CGAAATCCAA AAACTATCAT GAAAGAAGTC
TTTGATTTTC TCGGTTTGCC AAACTATAAT GTTCCCGAAT ACAATAAATT AAATTTAGGA
TATTATCCCT CTACAAGTAA ATCAATGCAA CAAAAATTGA GCAATTTCTT TCGACCTCAT
AATCAAAAGT TGGAGGAGTA TTTAGGTATG AAATTTAACT GGGAAAGTTG A
 
Protein sequence
MNQETSAINF NQTAEFYLSQ GKLEAAYEIC QNILGDLPNF APAYNTQGKV LQAMGKIESA 
IISYRQAIKL NPQQIETYKI LGDILVKQEQ LSEAIACYET GIKYNPKASL FYHKLGLVLI
QLKSWDEAVS AFCRAIQFNP NFPWSYYKLG EALTQQKKWH QAVIAYQRSI EIKPDLCWSY
QHLGNALIKQ GKIDQAIAYY QEILQQQPHL DRIHKLLADA LVEKGEIDGA IPNYLKAIQL
NPDFPWSHVC LWEIFLKKDQ WNEAVIIYRQ AIKLNPNAFW LWTYLGNALV KQGDLETAIT
CYQKAISIQP NISKIYQFLG DAFVQQQKWD EAAFAYLRAI EINPELSWSN YHLWNTLDRC
HKLDAVVNLY RQFIKKNPDS FLSYLRLGKI LTKQNQINEA IICYQTACYQ QTIKSYPYLF
KKKWDFTEVK NPNFLIIGVG KGGTTSLFSY LIQHPQVLPP VVKEVDFWSI NFKNGINWYL
SHFPALPSNQ NFITGEGSPS YLGNLEAPGR IFSYFSKIKL IIILRNPVDR AISHYHYWLR
INRENRLLET ALNQELESWK MIYKNSPLDS SYWHHGLYYL GTGIYIDFIH NWMSIFPKEQ
FLILSTEEFY RNPKTIMKEV FDFLGLPNYN VPEYNKLNLG YYPSTSKSMQ QKLSNFFRPH
NQKLEEYLGM KFNWES