Gene Tery_4800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4800 
Symbol 
ID4246454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7378708 
End bp7380708 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content38% 
IMG OID638109648 
Productgroup II intron, maturase-specific 
Protein accessionYP_724224 
Protein GI113478163 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.256541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAG CTAAAACACT AAAAAGAAAC TTAGGAAACC CTCAACCATT TTTAAGTGTG 
GCATGGGATA CAGAGGATAT ACCGTCTCAA GTTAGTATCA ATCTAACTCT TAAATGGAGA
GACATAGACT GGAGGCGGGC TATGAAGAAT GTATTTAAGT TGCAAAAGTT AATTTACAGA
GCATCCAGCT GTGGCAAAAT TCGCAAGATG CCTAGATACC AAAAACTTCT GACCAAAAGT
TATTACCCTT TGTTGCTAAC TGTTAGATGG GTGAGACAGG ATAATCAAGT TAAAAACACC
GTCGGGGTAG ATGGTATTAA AAATCTACCA CCAATGCAAC GTTTTAATTT TGTGTACCTA
CTCAAATCAC ATTCCCTGAA AGCCATTCTA AACCCTAGAG TCTCAATACT AAAGCAAGGG
TTAAATGCAA AATTTTCCTT AAGCATCCCA ACAATGTACG ACCGAGCATT ACAGGGCTCG
GTCAAGCTAA GTATGGAACC AGAGTGGGAG AAACGCTTTG AACCTAATAG TTATGGTTTG
GGACTAGGAC GATCAACCAA TGATGCTATT GAAATCATCT TTACCAGCAT CAAGAATAAA
TCTAAATACT TTCTTGATGT TGATATATCC AAATGTTTTG ATCTATTTGA TCTAATTAAC
CATGATGCAC TGCTAGGAAA AATTAAGAAA TCATCCTACA GACGACTAAT CAAACAATGT
TTAAAGTCTA GAGTTTTAGA CAATAATCAA TTCTACCCAC CCCAACAGCT CCTATTCACC
TACCCCGACA GCTTCCTTCT CGGAAATAGA TGTGGGTTGG TTGAGTTGGA TACAGCACAG
GGAGAGGTAA TGAGTCCTCT GCTAGCAAAC ATCACCTTAC ACGGTATAGA GAAAAGACTA
ATGGAGTTTG CCAAAACCCT AGATTTGAAA AATAAAAAGG GTGCTCAAAT GAGCTGGCAA
TCGAAATGTC AAAGTCTAAT TCTAGTAGAC TATGTAGATG AGTTCGTAAT TCTACAGGAA
GATATCAAAG TACTACTGCA AGCAAAAACC GTAATGCAGG AATGGTTAAA CCAGGTAAGA
TTAGAATTAA AACAAGAATT GACCAGAATT GCCAACGCTC TAGAAGAGTA TAAAGGGAAC
AAGCCTGGAT TGGACTTCTT CGGATTCACA ATGAGGCAAG GTCAGGCAAA GATAGCAAAA
CTTGGATTTC AAACCCTGAT TAAATTGTCC GCTAGAAGTA TTAAAACTCA TTACCGGAAA
CTGGCAGAAA GATTTGATTC ATACAAAATT GCACCAGTCA AAGGATTAAT TGCTAAACTT
AATCCAGCTA TCAGTGGATG GGTTAACTAT TTCTCAACAC AAGTAAGTAC TAATAAAATA
TTCAACAAAC TGGATATGCT TCTGTGGAAG AGACTATGGT GTTGCGCAAG TAGACAGCAT
CCAAACAATT CAGCCACCTG GGTTAAACAA GAGTATTTCC CGAATATTGA GAATGGAAAT
TGGATTCTCC CGCGGGGCGA ATATATGCTA AATCAATACT CTGATGTTCC CATCATAGGA
GACATCAAAG TAAAAGATAA TAACTTAACA CTAGATGGTG ACTGGAATTA TTGGACTAGC
AGAGTTGGCA AATATTCAGG GGTAAAAACA GAGACTTCAA AATTATTCAA AAGTCAGAAG
AATAAATGTG CATTTTGTGG ATTGACCTTT AGAGTAACTG ACCTAATAGA GGTAAACTAT
GTAATACCTA AGTTTAAAGG TGGTGACAAC ATACTAAGGA ATAAACAATT GTTGCACCAA
TATTGCCACG AGACTAACAT TGCTTTAAAT CACAAGAGCT ATCTAATAGG CAATTCACAG
GACTTACCTG AATGTTACTT ATGGGTTAAC GATATGCTGA CACTAAAGCA GGGATGTACC
CTTGAATTGG GACCTCTAAC AGAGGAGCCG GATGAAGCGA AAGTTTCATG TCCGGTTCTG
AAGACAAGTC GGGTAGGGTG A
 
Protein sequence
MNKAKTLKRN LGNPQPFLSV AWDTEDIPSQ VSINLTLKWR DIDWRRAMKN VFKLQKLIYR 
ASSCGKIRKM PRYQKLLTKS YYPLLLTVRW VRQDNQVKNT VGVDGIKNLP PMQRFNFVYL
LKSHSLKAIL NPRVSILKQG LNAKFSLSIP TMYDRALQGS VKLSMEPEWE KRFEPNSYGL
GLGRSTNDAI EIIFTSIKNK SKYFLDVDIS KCFDLFDLIN HDALLGKIKK SSYRRLIKQC
LKSRVLDNNQ FYPPQQLLFT YPDSFLLGNR CGLVELDTAQ GEVMSPLLAN ITLHGIEKRL
MEFAKTLDLK NKKGAQMSWQ SKCQSLILVD YVDEFVILQE DIKVLLQAKT VMQEWLNQVR
LELKQELTRI ANALEEYKGN KPGLDFFGFT MRQGQAKIAK LGFQTLIKLS ARSIKTHYRK
LAERFDSYKI APVKGLIAKL NPAISGWVNY FSTQVSTNKI FNKLDMLLWK RLWCCASRQH
PNNSATWVKQ EYFPNIENGN WILPRGEYML NQYSDVPIIG DIKVKDNNLT LDGDWNYWTS
RVGKYSGVKT ETSKLFKSQK NKCAFCGLTF RVTDLIEVNY VIPKFKGGDN ILRNKQLLHQ
YCHETNIALN HKSYLIGNSQ DLPECYLWVN DMLTLKQGCT LELGPLTEEP DEAKVSCPVL
KTSRVG