Gene Tery_3305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3305 
Symbol 
ID4243611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5068307 
End bp5070214 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content39% 
IMG OID638108293 
ProductRNA-directed DNA polymerase 
Protein accessionYP_722884 
Protein GI113476823 
COG category[L] Replication, recombination and repair
[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease
[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00301539 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATAAAG CTGAAATACT AAAAAGAAAA TTGGACAATC CAAGGCCCTT TTCAAGTGTG 
GCATGGGACA CGTACGATAT ACCTAACCAG GTTTGTGTTA ATCCCAATCT CAAATGGAAA
GACATCAACT GGAAAAAGGT AGAAAAGTAT GTGTTTAAGT TACAAAAGTT AATCTATAGA
GCATCCAGCC GTGGCGAAAT CCGCAAAATG CGTAAATACC AAAAACTTCT GACCAAAAGT
TATTATGCAA GGTTGCTAGC TGTTAGGCGT GTGACTCAGG ACAACCAGGG AAAGAAAACT
GCTGGTATAG ATGGTATAAA AAGCCTTCCC CCAATGCAGA GGTTGAACCT GGTAGAAATG
TTAGGGTCAC GATTTCTTAA AGCAAGCCCA ACCCGTAGAG TCTGGATACC AAAACCAGGT
AGAGAAGAAA AACGTCCACT AGGCATACCC ACTATGTATG ATAGGGCACT TCAAGCACTG
GTAAAGTTAG GCATGGAACC AGAATGGGAA GCACTTTTTG AACCTAATAG TTATGGTTTT
AGACCAGGAC GGTCAACATA CGATGCTATT GCAGCAATCT ATGTCAGTAT TAACCACAAA
CCAAAATATG TTTTAGATGC TGACATATCC AAATGTTTTG ACCGAATTAA CCATGATGCA
CTGTTGGGAA AAATAGGAAA ATCCCCATAT AGAAAATTAG TTAAACAATG GCTAAAATCC
GGGGTATTTG ACAATAAACA ATTCTCAAAC ACTGTGGAAG GTACACCACA GGGAGGGGTA
ATATCACCCT TGCTAGCAAA CATCGCCCTA CACGGTATGG AAAAATGCCT AGAAGATTAT
GCAGAAACCC TCCCAGGGAC AAAGCGTGAT AATCAAAGAG CATTATCCTT AATACGATAT
GCCGATGACT TTGTAATCCT ACATAAAGAC ATCAAAGTAT TGTTACAAGC AAAAACTGTA
ATACAGGAAT GGTTAAACCA AGTAGGGTTA GAACTAAAAC CAGAAAAAAC CAAAATTGCC
CACACTCTGG AAGAATATGA AGGAAATAAA CCCGGATTTG ACTTTCTAGG ATTTACAATA
AGGCAATGGA AAGGTAAGAC AACCAAACAA GGATTCAAAA CACTGATTAA GCCATCATCT
AAGAGTATTA AAACTCATTA TCGGAAGCTG GCGGATATAG GTGACACCTA CAAAACCGTC
CCTACAAAAG CTCTAATAGC TAAACTTAAT CCGGTAATTA GAGGATGGGC CAACTACTTT
TCCACCGTAG TCAGTAAAGA GGTATATAAT AAATTAGACT ACCTTCTATG GGAAAGATTA
TGGAGATGGG CAAGTAGACG GCATCCAAAC AAGTCAGCCA AATGGGTCAA GAATAAGTAT
TTTCCTCGCT GCAAAGTCAC CAGAAACTGG TTACTTAACG ACGGCGAATA TATACTTAAC
CAACACTCAG ACGTTGCCAT AAAAAGGCAC GTCAAGGTAA AAGGCAATAA ATCCCCTTAT
GACGGTGATT GGACTTATTG GAGTAGTAGA ATCGGCAAAC ACCCAGGTGT AAGGAAAGAA
GTCACAACGC TGTTAAAACG GCAAAAGAAT AAATGCGCAT TTTGTGGACT AACCTTTAGA
TCAAATGACC TCATGGAAAT AGACCATATA AAACCAAAGT CTGAAGGCGG TGATAACTCA
ATTAAAAACA AGCAACTGTT ACACCGACAT TGCCACGATA CTAAAACTGC TTTAGATAAT
AAAACATACA CAAAACCTAA GTTACAGGAT TTACCTGATG AATATCTATG GGTAAATGAT
ATGTTAATTC TAAAACAGGG ATGTACCTAT GAAAAAGGAC GTTTAGGAGA GAAGCCGGAT
GAGGTGAAAG TCTCACGTCC GGTTTTGAAG ACGAGTCGGG TAAGGTAA
 
Protein sequence
MNKAEILKRK LDNPRPFSSV AWDTYDIPNQ VCVNPNLKWK DINWKKVEKY VFKLQKLIYR 
ASSRGEIRKM RKYQKLLTKS YYARLLAVRR VTQDNQGKKT AGIDGIKSLP PMQRLNLVEM
LGSRFLKASP TRRVWIPKPG REEKRPLGIP TMYDRALQAL VKLGMEPEWE ALFEPNSYGF
RPGRSTYDAI AAIYVSINHK PKYVLDADIS KCFDRINHDA LLGKIGKSPY RKLVKQWLKS
GVFDNKQFSN TVEGTPQGGV ISPLLANIAL HGMEKCLEDY AETLPGTKRD NQRALSLIRY
ADDFVILHKD IKVLLQAKTV IQEWLNQVGL ELKPEKTKIA HTLEEYEGNK PGFDFLGFTI
RQWKGKTTKQ GFKTLIKPSS KSIKTHYRKL ADIGDTYKTV PTKALIAKLN PVIRGWANYF
STVVSKEVYN KLDYLLWERL WRWASRRHPN KSAKWVKNKY FPRCKVTRNW LLNDGEYILN
QHSDVAIKRH VKVKGNKSPY DGDWTYWSSR IGKHPGVRKE VTTLLKRQKN KCAFCGLTFR
SNDLMEIDHI KPKSEGGDNS IKNKQLLHRH CHDTKTALDN KTYTKPKLQD LPDEYLWVND
MLILKQGCTY EKGRLGEKPD EVKVSRPVLK TSRVR