Gene Tery_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1831 
Symbol 
ID4241907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2795958 
End bp2797253 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content42% 
IMG OID638106952 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_721560 
Protein GI113475499 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAG AAATATTTAT GCCTGCCCTT AGTTCTACCA TGACCGAGGG TAAAATTGTT 
TCTTGGCAAA AAACTTCCGG TGATTGGGTA GAAAAGGGCG AAACAGTTGT GGTGGTTGAA
TCAGACAAAG CAGATATGGA TGTGGAATCC TTCTTTTCTG GATATTTGGC CACAATTATT
GTGGAAGCAG GAGATGTCGC TCCAGTTGGA TCTACTATTG GTTTGTTAGC TGAAACAGAG
GCAGAAATTG AGCAGGCAAA GCAACAAGGT GTGACTACTC TAAACAAAGA ACCTGCTAAT
ACTTCTAGCT CCACAACTCC TGTGGCCACA GCTCCTATTT CTACAGCCAC AGAAAACCAA
GAAAATTCTA GTCGTCGTAA TGGACGAATT ATTGCTTCCC CTCGTGCCCG GAAGTTGGCC
AAAGATTTGA AGGTCGATTT GAGTACCCTC AAAGGTAATG GTCCTCATGG TCGTATTGTG
GCAGAGGATG TGGAAATGGC GGCGGGTCGT ATACCTGCTG TGGTAGCTGC ATCAGCAAAG
AGTACTATTC CTACTACTCC AACTCAAGTA TCTATTCCTG CTCCACCACC ACCACCATCT
GTAGTGTCTG CTCCAGTAAC TCCTGGTCAA GTTGTGCCAA TGAATAGTCT GCAAAATGCA
GTGGTGCGGA ATATGAATGT AAGTTTATCT GTACCGACTT TCCATGTTGG TTATACTATA
ACTACAGATA ATTTGGATAG GTTGTACAAA CAAATTAAGT CTAAGGGTGT GACTATGACT
GCTATTTTGG CAAAAGCAGT GGCAATAACC TTACAAAAAC ATCCTTTGTT GAATGCTGTT
TATGTGGATC AGGGTATTCA GTATCCCTCT GGGATTAATA TTGCAGTGGC AGTAGCAATG
CCAGATGGTG GTTTGATTAC ACCAGTATTG CCAAATGCTG ACAAAATGGA TATTTATTCT
TTGTCTCGTA CTTGGAAAGG TTTGGTAGAT AGGGCGCGGG CAAAACAATT ACAAGCAAAT
GAATACAGCA CTGGTACCTT TACTATTTCT AATTTGGGAA TGTTTGGGGT AAATAGGTTT
GATGCTATTT TACCACCTGC TCAAGGTTCG ATTTTGGCGA TCGGCGCATC TCAACCCCAG
GTAGTGGCCA CTGATGATGG GATGATTGGG GTTAAGCGAC AAATGGAGGT GAATATTACT
TGTGACCATC GGATTATTTA TGGTGCTGAT GCTGCTGCTT TCTTGCAAGA TTTGGCGAAT
TTGATTGAAA ATAATTCACA GTCTTTGACT ATGTAG
 
Protein sequence
MIKEIFMPAL SSTMTEGKIV SWQKTSGDWV EKGETVVVVE SDKADMDVES FFSGYLATII 
VEAGDVAPVG STIGLLAETE AEIEQAKQQG VTTLNKEPAN TSSSTTPVAT APISTATENQ
ENSSRRNGRI IASPRARKLA KDLKVDLSTL KGNGPHGRIV AEDVEMAAGR IPAVVAASAK
STIPTTPTQV SIPAPPPPPS VVSAPVTPGQ VVPMNSLQNA VVRNMNVSLS VPTFHVGYTI
TTDNLDRLYK QIKSKGVTMT AILAKAVAIT LQKHPLLNAV YVDQGIQYPS GINIAVAVAM
PDGGLITPVL PNADKMDIYS LSRTWKGLVD RARAKQLQAN EYSTGTFTIS NLGMFGVNRF
DAILPPAQGS ILAIGASQPQ VVATDDGMIG VKRQMEVNIT CDHRIIYGAD AAAFLQDLAN
LIENNSQSLT M