Gene Tery_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0239 
Symbol 
ID4242394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp373560 
End bp375467 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content39% 
IMG OID638105583 
ProductRNA-directed DNA polymerase 
Protein accessionYP_720199 
Protein GI113474138 
COG category[L] Replication, recombination and repair
[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease
[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0192073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAG CTGAAATACT AAAAAGAAAA TTGGACAATC CAAGGCCCTT TTCAAGTGTG 
GCATGGGACA CGTACGATAT ACCTAACCAG GTTTGTGTTA ATCCCAATCT CAAATGGAAA
GACATCAACT GGAAAAAGGT AGAAAAGTAT GTGTTTAAGT TACAAAAGTT AATCTATAGA
GCATCCAGCC GTGGCGAAAT CCGCAAAATG CGTAAATACC AAAAACTTCT GACCAAAAGT
TATTATGCAA GGTTGCTAGC TGTTAGGCGT GTGACTCAGG ACAACCAAGG AAAGAAAACT
GCTGGTATAG ATGGTATAAA AAGCCTTCCC CCAATGCAGA GGTTGAACCT GGTAGAAATG
TTAGGGTCAC GATTTCTTAA AGCAAGCCCA ATCCGTAGAG TCTGGATACC AAAACCAGGT
AGAGAAGAAA AACGTCCACT AGGCATACCC ACTATGTATG ATAGGGCACT TCAAGCACTG
GTAAAGTTAG GCATGGAACC AGAATGGGAA GCACTTTTTG AACCTAATAG TTATGGTTTT
AGACCAGGAC GGTCAACATA CGATGCTATT GCAGCAATCT ATGTCAGTAT TAACCACAAA
CCAAAATATG TTTTAGATGC TGACATATCC AAATGTTTTG ACCGAATTAA CCATGATGCA
CTGTTGGGAA AAATAGGAAA ATCCCCATAT AGAAAATTAG TTAAACAATG GCTAAAATCC
GGGGTATTTG ACAATAAACA ATTCTCAAAC ACTGTGGAAG GTACACCACA GGGAGGGGTA
ATATCACCCT TGCTAGCAAA CATCGCCCTA CACGGTATGG AAAAATGCCT AGAAGATTAT
GCAGAAACCC TCCCAGGGAC AAAGCGTGAT AATCAAAGAG CATTATCCTT AATACGATAT
GCCGATGACT TTGTAATCCT ACATAAAGAC ATCAAAGTAT TGTTACAAGC AAAAACTGTA
ATACAGGAAT GGTTAAACCA AGTAGGGTTA GAACTAAAAC CAGAAAAAAC CAAAATTGCC
CACACTCTGG AAGAATATGA AGGAAATAAA CCCGGATTTG ACTTTCTAGG ATTTACAATA
AGGCAATGGA AAGGTAAGAC AACCAAACAA GGATTCAAAA CACTGATTAA GCCATCATCT
AAGAGTATTA AAACTCATTA TCGGAAGCTG GCGGATATAG GTGACACCTA CAAAACCGTC
CCTACAAAAG CTCTAATAGC TAAACTTAAT CCGGTAATTA GAGGATGGGC CAACTACTTT
TCCACCGTAG TCAGTAAAGA GGTATATAAT AAATTAGACT ACCTTCTATG GGAAAGATTA
GGTCGATGGG CAAGTAGACG GCATCCAAAC AAGTCAGCCA AATGGGTCAA GAATAAGTAT
TTTCCTCGCT GCAAAGTCAC CAGAAACTGG TTACTTAACG ACGGCGAATA TATACTTAAC
CAACACTCAG ACGCTGCCAT AAAAAGGCAC GTCAAGGTAA AAGGCAATAA ATCCCCATTA
GACGGTGATT TGACTTATTG GAGTAGTAGA ATCGGCAAAC ACCCAGGTGT AAGGAAAGAA
GTCACAACGC TGTTAAAACG GCAAAAGAAT AAATGCGCAT TTTGTGGACT AACCTTTAGA
TCAAATGACC TCATGGAAAT AGACCATATA AAACCAAAGT CTGAAGGCGG TGATAACTCA
ATTAAAAACA AGCAACTGTT ACACCGACAT TGCCACGATA CTAAAACTGC TTTAGATAAT
AAAACATACA CAAAACCTAA GTTACAGGAT TTACCTGATG AATACCTATG GGTAAATGAT
ATGTTAATTC TAAAACAGGG ATGTACCTAT GAAAAAGGAC GTTTAGGAGA GAAGCCGGAT
GAGGTGAAAG TCTCACGTCC GGTTTTGAAG ACGAGTCGGG TAAGGTAA
 
Protein sequence
MNKAEILKRK LDNPRPFSSV AWDTYDIPNQ VCVNPNLKWK DINWKKVEKY VFKLQKLIYR 
ASSRGEIRKM RKYQKLLTKS YYARLLAVRR VTQDNQGKKT AGIDGIKSLP PMQRLNLVEM
LGSRFLKASP IRRVWIPKPG REEKRPLGIP TMYDRALQAL VKLGMEPEWE ALFEPNSYGF
RPGRSTYDAI AAIYVSINHK PKYVLDADIS KCFDRINHDA LLGKIGKSPY RKLVKQWLKS
GVFDNKQFSN TVEGTPQGGV ISPLLANIAL HGMEKCLEDY AETLPGTKRD NQRALSLIRY
ADDFVILHKD IKVLLQAKTV IQEWLNQVGL ELKPEKTKIA HTLEEYEGNK PGFDFLGFTI
RQWKGKTTKQ GFKTLIKPSS KSIKTHYRKL ADIGDTYKTV PTKALIAKLN PVIRGWANYF
STVVSKEVYN KLDYLLWERL GRWASRRHPN KSAKWVKNKY FPRCKVTRNW LLNDGEYILN
QHSDAAIKRH VKVKGNKSPL DGDLTYWSSR IGKHPGVRKE VTTLLKRQKN KCAFCGLTFR
SNDLMEIDHI KPKSEGGDNS IKNKQLLHRH CHDTKTALDN KTYTKPKLQD LPDEYLWVND
MLILKQGCTY EKGRLGEKPD EVKVSRPVLK TSRVR