Gene Tery_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4212 
Symbol 
ID4245864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6496894 
End bp6497961 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content31% 
IMG OID638109109 
Producthypothetical protein 
Protein accessionYP_723687 
Protein GI113477626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.882306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTAC TTATAGCCGA GACTGTTACA ACACATTTGA CCAAAACTAT GGATATAATT 
TTTATTGATA CTTCAATTTT TGAATCAAAC AACTTTTTGG AAAGCGATAG GATTAAAGAA
GTGTATAAGC TAGCTGAAAG AGGTGAAATA AAGGTTGTTC TTCCAGAATT GACTTATGAT
GAAATCTTAA ACCGAATATC AAAGAATATT GAGGAGTCTA GTCAAAAATT CAACAAATAT
CGACAGGACA CTCGAATTCT TAGGAATATA CCTTCCCTCT CTGAAAAATT TAAACCATTT
GAAAAAAGAC AAGTTAAAGA AGAACTATAC GAAATCGTAA AAGAAAAATT TTCGCAATCT
AACTTCGAAA TTATTGCTTA CCCCACATTA AACATTAAAG AGATTTTTAG GAGTTATTTT
GATAAAACAT TTCCTTTTGG ATCGGGAGGA AAAAAAAGTG AATTCCCTGA TGCTTTTACT
CTGAAATCCA TAGAGATTTG GGCAGAGGAA AACAACGTTA AAGTTTTAGC ATTCTCAAAG
GACAAGGATA TGCTAAAGTA CACGAGCGAG CATTTAGAAA TAATCGAAGA TTTCAACAAG
TATTTAAGTG ACAAAATAAA AGAAATAGAA GTTGCTTCAA ACAAAAAACG TCTTCGTCTT
GACCAAGTAG AAGACATTAT TCAAAACAGA CCCGAAAGGA CTCAAAAAGA AATTAAAGAA
TGGGTTGAGA ATCAGCTTGA TAATTATTCA AAATATTATG ACTATTCGAA CCAATGTGAA
GTTCACGATG TGTCAATTAT AGAAGTGGAA ACTAATATTG AAGATTATAC CATAACTAAT
ATTTCTAAAG ACTACATATC AGTTGAGTTG AGAGTACGGA TAAACTATCA AGTTCAAATA
ATAATTGATG ATGAGGACTC TATATATAAA GACGATGATA CTAAAGAATT GTTTTTCCGA
GAAACCAAGC CAGATTTAGT GGGCGACATA ATAGATATTG ATGTTGATTT AAGATTTTAC
TTTGAACCTG ATGATGACAC TGTTTACACT TACCAACCCC TAATCTAA
 
Protein sequence
MQLLIAETVT THLTKTMDII FIDTSIFESN NFLESDRIKE VYKLAERGEI KVVLPELTYD 
EILNRISKNI EESSQKFNKY RQDTRILRNI PSLSEKFKPF EKRQVKEELY EIVKEKFSQS
NFEIIAYPTL NIKEIFRSYF DKTFPFGSGG KKSEFPDAFT LKSIEIWAEE NNVKVLAFSK
DKDMLKYTSE HLEIIEDFNK YLSDKIKEIE VASNKKRLRL DQVEDIIQNR PERTQKEIKE
WVENQLDNYS KYYDYSNQCE VHDVSIIEVE TNIEDYTITN ISKDYISVEL RVRINYQVQI
IIDDEDSIYK DDDTKELFFR ETKPDLVGDI IDIDVDLRFY FEPDDDTVYT YQPLI