Gene Tery_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0447 
Symbol 
ID4242225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp705273 
End bp706340 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content34% 
IMG OID638105765 
Producthypothetical protein 
Protein accessionYP_720379 
Protein GI113474318 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.101585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCAA ATCTTGACAA ATCTACAATT GAGTTCATTG AGCAAGAAGT AAGCAAATTA 
CTCAAAAATT TGCCCAACTT GAAACAGCAA AAATGGATAA GGCGATCGCT TTCCACTATA
GTTAATATCG CAGGAGAAGA ACTTGAGTAC CTGGACTGGA AGATATTAGC GGCCTCCTTA
CAAGACATGG AAAATGGATT TAAAACATTT TATCCCTATC GCCATATACC TAAAATTACT
ATATTTGGTT CTGCACGAGT ATCCTCAGAT ACCCCAGAGT ATAAAATGGC AGTAGAATTT
GCTCATCATA TGACTAAACA AGGTTTTATG GTAATGACCG GTGCAGGTCC TGGGATAATG
CAAGCTGGTA ATGAGGGAGC AGGAAGCAAC AAATCTTTCG GTCTCAATAT TCAGTTACCT
TTTGAGCAAG AGTCTAATCC ATTTATAAGA GGTAACAATA AATTAATTAA CTTTAAATAC
TTTTTTACTC GTAAATTATT TTTCCTGAAA GAAACTGATG CTATAGCTTT ATTTCCAGGT
GGTTTTGGTA CCCTAGATGA GGCATTTGAA ACTTTAACAC TAGTACAAAC AGGTAAATTT
GGCCCCGCTC CTGTAATATT AATAGATTAT CCTGGTGGAA ACTATTGGTA TGACTGGAAT
GATTTTATAA ATAAACAATT ACTCCTACGA GGTTTGATAA GTCCAAATGA TTATAATTTT
TACACTATTA CAGATAACTT AGAATCAGCT TATGAAGCGA TCGCCAATTT TTATAGAGTT
TATCATTCTA GTCGTTATGT AGGAGAAAAG TTTGTAATTC GTCTTAAATC AGAAATTTCA
GACAAAGATG TAGACTTTTT GAATCAAGAA TTTAGGGATA TTTTAACTGT AGGAAATATA
GAGAAAAGTC AAGTATTACC AGAAGAGTCC GCAGATAAAA CCTTTCACTT ACCACGTTTA
GTTATGCATT TTAACCACAA GGACATAGGC AGACTATATC AAATGATTGA TCAGATAAAT
AAAGTGGGAG GCAATATACA AAAATTCAGT CATTTAGGAA GTAAGTAA
 
Protein sequence
MASNLDKSTI EFIEQEVSKL LKNLPNLKQQ KWIRRSLSTI VNIAGEELEY LDWKILAASL 
QDMENGFKTF YPYRHIPKIT IFGSARVSSD TPEYKMAVEF AHHMTKQGFM VMTGAGPGIM
QAGNEGAGSN KSFGLNIQLP FEQESNPFIR GNNKLINFKY FFTRKLFFLK ETDAIALFPG
GFGTLDEAFE TLTLVQTGKF GPAPVILIDY PGGNYWYDWN DFINKQLLLR GLISPNDYNF
YTITDNLESA YEAIANFYRV YHSSRYVGEK FVIRLKSEIS DKDVDFLNQE FRDILTVGNI
EKSQVLPEES ADKTFHLPRL VMHFNHKDIG RLYQMIDQIN KVGGNIQKFS HLGSK