Gene Tery_3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3933 
Symbol 
ID4244016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6079121 
End bp6080560 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content40% 
IMG OID638108855 
Producthypothetical protein 
Protein accessionYP_723437 
Protein GI113477376 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.218593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000403133 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACCAAAA ATATTATAAA TACTATGACC ATAGATGCTG ATACCTTTTT TCAGGCAATA 
AACCCTAGGG TTCCTCTGTT TATAGATAAT GCGGAAATTG ACAAAAAATA CTATATTGAC
TTTTCTCCGG TGCGTGGGCA ACAGGTGATT AAGGATTTGA AGTCTACTAT TACTCGTTGG
TCTAAAGGTA AACCTACCTG TCAATTATTT ACTGGACATA TTGGTTGTGG TAAGTCTACA
GAACTCTGGC GACTGAAGCA ACAGTTGGAA ACAGCAGGTT ATCATGTTGT TTATTTTGAG
TCTGAAAAAA ACCTGGAAAT GGTAGATGTG GATGTTAGTG ATATTCTTTT GACTATTGCC
CAAAAGGTGA GTGAAAGTTT GGAGAAGTTA GAAAGATTAA ATTTGGAAGA GCCAAAAAGG
CTAAAAGGTC TACTTCAGAG TATTGTTAAA TTATTACAGA CAGAAATTGA ATTTTCGGCA
GAAACTACTG TTCCTGGTGT GGGTAAGTTG TCAGCTAGCA GTGATGGTTC ATTTTCGGCA
GATTTAGGAA TAGTTGAAGT GAAGGCTGAT GAGGAAGGGT TAGAGTTTGT GGCTTCAGGT
ATTGGTAAAA TTTCTGCACA GGCAAAGGGT AGCCCGGAAC TCCGCACTAA ACTGAGGGAA
TATCTGGGAC CTCGCACCCC TGGAATTATT GAGATGATCA ACAAAGAGTT GCTTGAACCT
GCTGATCAGA AGTTGAAAGA GTATGGTAAA AAGGGGTTGG TTGTGATCGC TGATAGTCTT
GATAAGGTTG ATAGTTCCCC AAAACCTTGG GGTAGAAATC AGCAGGAATA TTTGTTTGTA
GACCGAGGAG AGCAACTAAC AAGCCTTCAG TGTCATCTGA TTTATACTTT ACCTATAGCA
CTGCGTTTTT CTAATGACTA CGGTACTTTA ACTCAAAGGT TTGATGCTCC TAAGATATTG
CCGATGGTAG CTACACAGTT ACAGGATGGC AGTGAATGTA TTGCTGGAAT GGAGTTAATG
CGACAGTTGG TTTTGGCTAG GGCTTTTCCA GAGTTAACAC CACAGGAAAG GTTGGCAAGG
GTGACGGAAG TGTTTGATAG TCAGGAAACT TTAGACTATT TATGTTGGGT TAGTGGGGGT
CATGTCAGGA ATATGTTCCG AATGGTACTT GATGCTCTCA AGGAGGAGGA TGACTTACCT
ATTTCTCGTG GGAGTGTTGA CAATGTAGTG AGAAATTATC GCAATGAACA ACTTTTGGCT
ATAGATGATC ACGAGTGGGA GTTATTACGG CAGGTAGTTC AAACAAAAAA AGTAACGGGT
GATGACGGAT ACCAAATTTT AATCCGGAGC ATGTTTGTTT ATGAGTATCA ATATGACCAG
AGTTCTTGGT TTAATATTAA TCCTCTTTTG AAAGATGCAC CAGAATTGAA GATAAGTTAA
 
Protein sequence
MTKNIINTMT IDADTFFQAI NPRVPLFIDN AEIDKKYYID FSPVRGQQVI KDLKSTITRW 
SKGKPTCQLF TGHIGCGKST ELWRLKQQLE TAGYHVVYFE SEKNLEMVDV DVSDILLTIA
QKVSESLEKL ERLNLEEPKR LKGLLQSIVK LLQTEIEFSA ETTVPGVGKL SASSDGSFSA
DLGIVEVKAD EEGLEFVASG IGKISAQAKG SPELRTKLRE YLGPRTPGII EMINKELLEP
ADQKLKEYGK KGLVVIADSL DKVDSSPKPW GRNQQEYLFV DRGEQLTSLQ CHLIYTLPIA
LRFSNDYGTL TQRFDAPKIL PMVATQLQDG SECIAGMELM RQLVLARAFP ELTPQERLAR
VTEVFDSQET LDYLCWVSGG HVRNMFRMVL DALKEEDDLP ISRGSVDNVV RNYRNEQLLA
IDDHEWELLR QVVQTKKVTG DDGYQILIRS MFVYEYQYDQ SSWFNINPLL KDAPELKIS