Gene Tery_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1518 
Symbol 
ID4241705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2306692 
End bp2308485 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content31% 
IMG OID638106665 
ProductTPR repeat-containing protein 
Protein accessionYP_721275 
Protein GI113475214 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID[TIGR02466] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00971618 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACAGAG AACAAAAAAG AAAAGCAGTA AAAAATAAAA GTCAAAAAAT ACTCAATAAT 
AATATCCCAG AAAATGCCTT ATATCGTGCT TTATCTTTTC ATAAAAACGG AGAATTGCAA
AAAGCAAAAG TTATCTATGA AAAAATTATT CAGTTGGAAC CAAATAATTC TCAAGTCTTA
AATTATCTGG GGGTACTTAA AGCACAAATG GGAGATAATA AGAGTGCTAT AAGTTTAATA
AAGAAAGCTG TTAATCTAGA ACCATTAAAT TTTAAATACC TCAATAATCT CGGCAATACT
TATCGCGCCT TTGAACAACT TGATAATGCT ATTGACTGTT ATAAACGTGC GATTCAGGCA
GATAAAAATT CTGCTGAATA TCACCTAAAC TTAGGTATTG CTTTGACCGA AAAAGGAATC
ATAGAAGAAG CGATCGCTTC CCTAGAAAAA GCTTTAACTA TTAATCCAAA TTATCAGCAA
GTTAACATGG CTCTAGGAGA TATATTTCAG ACTCAAGGGA AATTAGATAA AGCAATATCT
TCTTACATAA AAGCTCTCTC AATAGATCCA AAATATTCAA AGAATCAAAA TCCCAATAAT
TTTGATGCTC TATTATCTTT AGGCATGGCA CTTTATCGCC GTGGAAACCT CAAAGAATCA
CAAATAACTT ATGAACAAGC ATTAGAAATT AATCCTCATT CCACAGAATG CTTGACTAAT
ATTGCTGCTA CTTTTTATGA GCAGGGAAGA GTTGATATAG CTGAAGCTTG TTATCAAGCT
GTTGTTGATT TAATTCCTAC TTCTACTGAT GCTCATATAA ATTTGGGTTT TCTGTTAAGC
CAACAGGAAA AATATGACGA AGCAATTGAG TGTTATAAAG CAGCATTGAA GCAGGATCAA
AATTCTGTTA ATGCTATAGC AGGTCTAGCA GAAGTTTTCG GAAAAAAATC TGACTGGAAA
ACAGTCTTTC AACTATATCA AAAAATTCTG AAACTTGATT CTAACTCTGC TGATGCTTAT
GCAAAACTAG GAATATCTTT GCGAGAAATA GGTAAATCTA AAGAAGCAAT TCCTCAATTT
GAAAAAGCTA TAAGTATTAA TAATAGACAT ATTAAAGCTT ATGCTAATTT GGGTTTAGCT
TTGCAAGATG TAGGCAAACA GTCTGAAGCT AAATTTATTT TTGACTATTC AGAATTAGTA
GCTAAATACC AGTTTACTAG TATAGAAGGA TGGCAAAATT TGACAGCTTA TAATCATGAC
TTGAAAGATT ATATTGTTCG ACATCCTACT CTTTTAAAAA ATCGACCAGG CAAACCCATA
AATAAAGGTA GTCAAACTTA TGAAATTTTT ACAGATAATA CTCCTGTAAT AACAGCACTA
AGAAAAAAAA TTAATAGTTA TCTAGAAGAT TATTTTAGTC GTTTCACTTC TAATTCTAAC
TATCAATTTT TTCATAATAT ACCCTTAGAT TGGAAACTCA GTGGTTGGGC AGTTGTTTTA
GAATCAGAAG GATTTCAATC TTCTCATATT CATCCTGAAA GTTATTGCAG TGGCGTTTAT
TATATTCAAG TTCCTAATAC TATTAAAGAA AATAATTCTG AAGCAGGTCA CTTAAATTTT
GAAACTTGCT TTTCTTCAAA GTTAGATATG AAAAATTCAG ATAAATATCA GGTTAAACCA
CAAGCAGGAT TATTAGTAAT TTTTCCTTCT TATTTTTGGC ACTCTACTAT TCCATTTATT
GGAGATAGTG AGAGAATTTG TATTTCGTTT AATATGGTTC CTGTGAGTAG CTAA
 
Protein sequence
MNREQKRKAV KNKSQKILNN NIPENALYRA LSFHKNGELQ KAKVIYEKII QLEPNNSQVL 
NYLGVLKAQM GDNKSAISLI KKAVNLEPLN FKYLNNLGNT YRAFEQLDNA IDCYKRAIQA
DKNSAEYHLN LGIALTEKGI IEEAIASLEK ALTINPNYQQ VNMALGDIFQ TQGKLDKAIS
SYIKALSIDP KYSKNQNPNN FDALLSLGMA LYRRGNLKES QITYEQALEI NPHSTECLTN
IAATFYEQGR VDIAEACYQA VVDLIPTSTD AHINLGFLLS QQEKYDEAIE CYKAALKQDQ
NSVNAIAGLA EVFGKKSDWK TVFQLYQKIL KLDSNSADAY AKLGISLREI GKSKEAIPQF
EKAISINNRH IKAYANLGLA LQDVGKQSEA KFIFDYSELV AKYQFTSIEG WQNLTAYNHD
LKDYIVRHPT LLKNRPGKPI NKGSQTYEIF TDNTPVITAL RKKINSYLED YFSRFTSNSN
YQFFHNIPLD WKLSGWAVVL ESEGFQSSHI HPESYCSGVY YIQVPNTIKE NNSEAGHLNF
ETCFSSKLDM KNSDKYQVKP QAGLLVIFPS YFWHSTIPFI GDSERICISF NMVPVSS