Gene Tery_4337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4337 
Symbol 
ID4245989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6685688 
End bp6687259 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content37% 
IMG OID638109224 
Producthypothetical protein 
Protein accessionYP_723802 
Protein GI113477741 
COG category 
COG ID 
TIGRFAM ID[TIGR03605] SagB-type dehydrogenase domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.519761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTC CTTGTAGGAA AGTAGTGGTC TACTCGCAAC ATTTAGATAA AAAGATGGCA 
GAAGTTAAAC AATCTATCGC TCAATATTAT CATGAACAGA CAAAATATGA CCCAGAAACC
ATTGGGAGGA AAAATCACCA ACTAGACTGG GAAAATCAAC CAATACAATT TAAAGAGTAT
AAATTTGGGA CAACTATTGA CCTGAAACCT TATCTACAAG AAAAAATTCA ACAAAAAGAT
CCTAAAGAAG AAGAAAATCT ATCCAACTTA CTTGTATGTA CTTATGGCTT AACCGGTAAG
ATACCAACAA TGATGGGGAA TCATTACTTA AGGGCGGCTC CTTCTGCTGG TGGGTTGTAC
CCAGCAGAGG TTTACTTAAT TTCCCGTGGC ACACCCCTAT TACCTACAGG ATTATATAAC
TATCAGGCCA AAACTCATTC CTTGCTCAAA TTTTGGGACA ATAGAGTCTG GCCAGAATTG
CAGGATGCTT GTTTTTGGCA TCCTGCTCTA GAAAATACAA AATTAGCGAT CGCCATCACT
GCCATATTCT ATCGTTCTGC TTGGCGTTAC GAAGATCGGG CTTATAGACG AATATTTTTA
GATGCAGGTC ACTTACTAGG TAACGTTGAA CTAGCTTCTA GTTTCAGTAA CTACCGATCA
CATCTAATTG GGGGGTTTAA TGATGAAGCA ATAAATCAAA TGCTTTATTT AGATAATCAG
GAGGAACAGG CGATCGCCAT TGTCGCCCTT GCAGACTTAT TAGATATTGA ACAACATCTG
CCCCACTGGC AAGCTGCTTT ACCTTCTGGG ACTCAAACAG ATTATCCGAA AATTCAAGAG
GGTGACCTCT TAAAATATTT ACATAAAGCT ACTCATATTA ACTCAAATTC TCCAAAAGTA
TCTGCTTGGA AAACAAGAGC AACCCTAGAA CATTCAGAAG ATAAATATAA CTTTCCATTT
TGTACAAAAA TTTCAACTGT AACTCCTCCC ATTTCATGGG GTCAAGACTT ACAAGGTCTC
AGAAAGACAA TTCTCAAACG ACGTTCAACT CGCAGCTATT CGGGAGAAAC TATTGCTTTG
AATAAACTCT TGGCTGTCTT AGATTTTACT TATCAACCCC ACCATTACAT TAATCAAGAA
TTAGATCGTA ATCCTGATTA TTTTGATATA AGTTTGATTG AGACTTTTAT TGCAGTTTCT
GGAGTAAATG ACTTAGAAGA TGGTTGTTAT TATTATGCCC CAATAGCTCA AGAACTACGA
CAAATTAGGT TTAAAAATTT CCGAAAAGAA TTACATTTTA TGTGCTTAGG GCAAGACTTA
GGTAGAGATG CTGCTGCTGT TTTATTTCAT ACTGCCGACT TAAAAAAAGC TGTAAGCAAA
TATGGCGATC GCGTTTACCG TTATCTCCAT TCAGACGCAG GCCATCTAGG GCAAAGACTA
AATTTAGCAG CTATTTATCT AGGTTTAGGT GTTAGTGGTA TCGGCGGTTT CTTTGACGAT
CATGTGAACC AAGTTTTAGG TATTCCTGTA GATGAAGCAG TTATTTATAT AACAACTTTA
GGTTCTATTT AA
 
Protein sequence
MTLPCRKVVV YSQHLDKKMA EVKQSIAQYY HEQTKYDPET IGRKNHQLDW ENQPIQFKEY 
KFGTTIDLKP YLQEKIQQKD PKEEENLSNL LVCTYGLTGK IPTMMGNHYL RAAPSAGGLY
PAEVYLISRG TPLLPTGLYN YQAKTHSLLK FWDNRVWPEL QDACFWHPAL ENTKLAIAIT
AIFYRSAWRY EDRAYRRIFL DAGHLLGNVE LASSFSNYRS HLIGGFNDEA INQMLYLDNQ
EEQAIAIVAL ADLLDIEQHL PHWQAALPSG TQTDYPKIQE GDLLKYLHKA THINSNSPKV
SAWKTRATLE HSEDKYNFPF CTKISTVTPP ISWGQDLQGL RKTILKRRST RSYSGETIAL
NKLLAVLDFT YQPHHYINQE LDRNPDYFDI SLIETFIAVS GVNDLEDGCY YYAPIAQELR
QIRFKNFRKE LHFMCLGQDL GRDAAAVLFH TADLKKAVSK YGDRVYRYLH SDAGHLGQRL
NLAAIYLGLG VSGIGGFFDD HVNQVLGIPV DEAVIYITTL GSI