Gene Tmel_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_1446 
Symbol 
ID5297782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp1439689 
End bp1442958 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content46% 
IMG OID640769725 
Productfibronectin, type III domain-containing protein 
Protein accessionYP_001306678 
Protein GI150021324 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCCCAA AACTTAATGT GGATTCCAGC TACATTCCGT ATATACGCCA CCCAACACGC 
TACAAAAGTG TTGCAGTTTT CGCCAAAATC GACGGCACAA ACTGGTACGA CATCACCGAG
TGGGTCAAGG AAGTCCGCAT CGTCAACAAA CTCGAATTTC TCGAAAGCCC CGCTATTGAC
TCGGCCACGA TAATTCTCGC TAATCTCAGC AACGAATGGA CACCGACGCA GTATAACGAC
GCATTCGATC CTTCAAGCGG CAAATTCAAC GGTACCGTAG ATCAGGCTTA CCTCAGCAAG
GAATGGGAAG TCAAAATCCT GCTCCGTGTG TACAAGAGCG ATACCGAATA TATCGATGTC
CCGCTCTTCT ACGGTGTCAA AACCCAGCTC ACAGAACGGC ACAAAGAAGC AGAAATGAAA
GTGGCTGATA TATGCTACTA CGCAACGAAG AAAAAACTCG ATACAGACAT ACTTTACATC
GATATGAAAC CGCACGAGAT ACTCGCTGAT CTCTTTCAGC GTGCAGGCCT CGACTCGTCG
CAGTTTGACT TTCAGGAAGT CCAATATCCC TGCACATTTC TCGCAAGGAA AGACAGCACA
ATTTGGCAAA CCGTGATAAA CCTCGTTCGC GGCACAGCAG GGAAGATATC CACAACACCC
GAAGGCAAAA TCATATACAG AACAAGAATG GACACAAGTT CGTATTCCGA TCCCAACCCC
GCACTCTCGC TTCAGCAGGA CACTTTCAAA CGCTACGATC TTGGAACCGA AAGACGGTTC
AACAAGATCA CGCTTGAAAG CGAAGGATAC CGCGTCGATG ACGACCTGTC GTGGGTGATT
GACTTTGAGC TTCAGGGCAA CAACACAATA GCCCCTGGAA CTACTGCGAC CTTTGAACTT
GAGTATGTAT CAGACTACGC GGTGGCTGTG AGCGATACGA TATACATAAG CTATTCTGTC
GGTGCGGGCT TTCAGGAGGA CGTGCCGTTC ACGGTCTTGG AGAGCACCCC ACAGCCTTCA
AATGAGCATA TCCAGATCAC CAAGTACGAA AGATACGCCG ATAAAGCCGT TATCGAGATA
AAAGCCCTTA CGACATCAGA AAACGTCGTA ATAAACCATG TGAAAATTCA GGGGCGGCAG
GTCAAGAAGG TCAGTGTAAA CAAGCTGATC AAAGAAAACA CCACAGGCGA GCCCGACAAA
GAATACAGCG TCAGAACCTT CTTTGACTCG CAAGCTGTGA TGAGCAATAT AGCCGATGTG
CTGTACGAAA ACATCAACAA AACGATCCGC TTCGGTCTTG CGATGAACGA GTTCTACGCG
GATGTGTATG CGGGAAATCT TATCGAATTC GCAGTACCGC TTAAAGGGAT TAGTTCAGGA
ACATTCCTTG TTCTGAAGGT AGAGCACAGC CTGCAGTCTG CTAAGTTTCA GACATCGCTC
GATATCGTCG AGTGGAAGGA TATCGAATTC ACGACAGGCG ATAAGACCTT CACCCGCGCT
ACTCGCTCTC CTGACCCCGT CGAAAACCAA ACGCAGCAGC AAATCACTGA GGTTCAGGGA
CAGATTCAAG AGCTTCAGGA GCAGGTGAAT GAGGTTGATG AGAGAACGAA CTATATTGAC
AGTGCAGCAC CTTCTGTGCC GCAGAATTTG TCGCTTTCCA CAACTATGAA TGATAAAGGC
GAAAGCGTAG TTAGAGTAAG CTTTGACCCT GTACCGGAAG CTGATGTAAT AGGGTATGAA
GTAACTTGGA GTCTTGATGG TGTTCACTGG CACTATTATA CAACAGCAGA AACGCTGTCG
CAGTTTGTGG TGCCCGGAAA CACCACCGTG TATGTAAAGG TGCGGGCTCT GGATGCGGAA
GGAAAGAAGT CAGATTGGTC GAGCGCAGCT TCCATAACGA GCGCAAAAGA CGAGGTGCCG
CCTGCTGTTC CAACAGGTTT AACACCGACG GGACTCTTTC AGACGATCAT GGTTAAGTGG
AATCCAAACA CTGAAGATGA CTTCGACCAC TATGTCTTGC AGTACGACAC TAAGAGTGAT
TTTTCAACAG CAAAAGAGAT AGTTTTGAAC GCTACATCGG CTGTTATAAA AGATTTGGCG
GTTAATACGA CTTATTACCT GAGAATCAAA GCTGTTGATA AGTCTGGAAA TGCGAGCGAT
TGGAGTTCTG CCGTGACGGC CTCAACGGTG AAGTTGGACG ATGCGAGTTA CTACGACTAT
GCTGCAATTA AGGACGCTAT TATTCAAAAC GGAAAAATCG ATACAGCATG GATCAGCGAG
CTTGATGCTG GTGTGATAAC GACCGGTTAC CTCGACGCTG ACCGCATCCA GGCCCGCAGC
ATCACCCTCG ATAAACTCGC GGTGTCACCT GCGTTCTCAT TGCCCTCAGG CACCCTCGCC
TACTGGACTA ACTCTCTCAT CGACGAAGCA AATCAGATAA TGCCTGAGGG ATATACGGAG
GTTAATCTCG CCCCAAGGGT TACGCTTATT CCAGAAAATG CGCCTGAAGG TAGTGTTGTG
GGTGATTTGA TTGCAGCAAA TACAATATAT GCAGGCAAAC GCATCGATGT GGGGACAGGA
GTTAACCGAT GGGCGATTGA CGGTACTAAC GGGCTTGTTA GAGTTTTGAA TAATTCAGAT
TACCCAGTTT TAGGAATTAT TTATGGGCCA TATTCAATAA ATATGAATGG AATTACAGAA
CAAACAATAA CATTACCAGT TGCATTGACT GAGTATTTTG TCTTGCTTGG TATTAGTAAT
TTTGAGTATT GGCGTTCTAC TTTCGCCCCA AATTATTCAA GAAAATTAAT ATTAGATTGG
CAAAAAATTG ATAACCAAAG TTTTAAAATA TTAGCATATA CACAACTCGT GCAGCCAAAA
CAATATCCAA ACGCTCAAGT CACAGTTGAA TACAATGACT CCACAGAAAC ATATTATACT
TTGTATACAA TTACGGTTGC TGTAGATGAG GCTGTTGTGC GGATAAATGG CCCCGATCTT
TCTGTTGAAT ATTTGCACCG TTACTCTGGA ACACCACCTT ATACGATTAA AGTGTCGTAT
ACATGGGAGC TGAAAAAAAT TCTTCCAGAC GGCATTTATT TTGCCAAACT CGGTCATATT
GATTGGGGTG ACGGAACAAT AACAGAGTGG GCATTGCGAG ATAGCCCTTC ATCATCTTCG
AATGTTACAG CGACATTGGT AAGTTACACG CCGAGCACAC TAAAAGAATC AGCGAATGCC
ATGATTACCT ATTCTGTAAT AGGATATTAA
 
Protein sequence
MFPKLNVDSS YIPYIRHPTR YKSVAVFAKI DGTNWYDITE WVKEVRIVNK LEFLESPAID 
SATIILANLS NEWTPTQYND AFDPSSGKFN GTVDQAYLSK EWEVKILLRV YKSDTEYIDV
PLFYGVKTQL TERHKEAEMK VADICYYATK KKLDTDILYI DMKPHEILAD LFQRAGLDSS
QFDFQEVQYP CTFLARKDST IWQTVINLVR GTAGKISTTP EGKIIYRTRM DTSSYSDPNP
ALSLQQDTFK RYDLGTERRF NKITLESEGY RVDDDLSWVI DFELQGNNTI APGTTATFEL
EYVSDYAVAV SDTIYISYSV GAGFQEDVPF TVLESTPQPS NEHIQITKYE RYADKAVIEI
KALTTSENVV INHVKIQGRQ VKKVSVNKLI KENTTGEPDK EYSVRTFFDS QAVMSNIADV
LYENINKTIR FGLAMNEFYA DVYAGNLIEF AVPLKGISSG TFLVLKVEHS LQSAKFQTSL
DIVEWKDIEF TTGDKTFTRA TRSPDPVENQ TQQQITEVQG QIQELQEQVN EVDERTNYID
SAAPSVPQNL SLSTTMNDKG ESVVRVSFDP VPEADVIGYE VTWSLDGVHW HYYTTAETLS
QFVVPGNTTV YVKVRALDAE GKKSDWSSAA SITSAKDEVP PAVPTGLTPT GLFQTIMVKW
NPNTEDDFDH YVLQYDTKSD FSTAKEIVLN ATSAVIKDLA VNTTYYLRIK AVDKSGNASD
WSSAVTASTV KLDDASYYDY AAIKDAIIQN GKIDTAWISE LDAGVITTGY LDADRIQARS
ITLDKLAVSP AFSLPSGTLA YWTNSLIDEA NQIMPEGYTE VNLAPRVTLI PENAPEGSVV
GDLIAANTIY AGKRIDVGTG VNRWAIDGTN GLVRVLNNSD YPVLGIIYGP YSINMNGITE
QTITLPVALT EYFVLLGISN FEYWRSTFAP NYSRKLILDW QKIDNQSFKI LAYTQLVQPK
QYPNAQVTVE YNDSTETYYT LYTITVAVDE AVVRINGPDL SVEYLHRYSG TPPYTIKVSY
TWELKKILPD GIYFAKLGHI DWGDGTITEW ALRDSPSSSS NVTATLVSYT PSTLKESANA
MITYSVIGY