Gene Tery_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3014 
Symbol 
ID4244898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4661640 
End bp4662857 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content37% 
IMG OID638108049 
Product4Fe-4S ferredoxin, iron-sulfur binding 
Protein accessionYP_722642 
Protein GI113476581 
COG category[C] Energy production and conversion 
COG ID[COG1142] Fe-S-cluster-containing hydrogenase components 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00920835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGAGAAC GCTACTATCC TTTAAATTCC TTAAAAGAAG GCCACTGGTT TAAATTAATC 
TGTGGGGCAA GCTTTCAACA CCTGCCAACG GTGAGAAATT TAACATTAGC TTATACTTTG
GCGGGTGCAG ATTGTATTGA TGTGGCTGCA GACCCTGCGG CAATTGCTGC GACGAAAGAG
GCTGTAGAGG TCGCATCTCA GCTTAAGTCT TGGGCTAAAA ATCATAAATT TGGCTACCAA
GCAAGGCCAT TAATAATGGC CAGCATAAAT GATGGAGAAG ATCCTCATTT TCGGAAAGCT
GAATTTGACT CTACCATTTG TCCTATAGAA TGTTGGCGGC CTTGTGAGAA GGTATGTCCT
GCAGAAGCTA TAGTTTTCTT TGAAAAAGAT AATGCTATAA ATCTTGGTAA TTCAGGAGTT
ATAGATGAAC TTTGCTATGG TTGTGGTAGG TGTTTATCTG TATGTCCGAA TCAACTAATA
CAAGCTCGCT CTTATGTCTC TACTCCTAGT TCTATTGCAT CATTAATATT GCAGACTGGT
GTAGATGCAA TAGAAATTCA CACTCAAGTG GGAAGAGAAA CTGATTTTCA GCGACTCTGG
AAAAGTATAA AGCCTTGGGT CAATCAATTA AAACTAATTG CTATTAGTTG TCAAGACGAC
GAGGGGTTAA TAGAATATTT ACAGAGTCTA TATAAAATTA TTTCTCCTCT TCCTTGTAGC
CTAATTTGGC AAACTGATGG TAAGCCCATG AGTGGGGATA TTGGTGTAGG TACTACCGCA
GCTACAATTA AATTGAGTCA AAAAGTGCTA GACGCAGGAT TACCGGGGTA TGTACAACTA
GCAGGTGGTA CAAATAATTC TACCGTAGTC AAATTAATTG CTCTGGGTTT GCTAGCGAAT
AAGAGTCTGA AACAAGATAA AGATTACATC AATAATTTCA ATATTCCCAT TTCAGAAAAT
AAATTTGTTG CTGGTGTAGC TTATGGTAGC TATGCCCGTA TTCTATTATC ACCAATCTTG
GAAAAATTAG AAAAAATGCA TAAATCACAA CTAGAGGCAA TTGAATTTGC TTACGATACA
GCTAGTGTCA ATACAAAAGA TGTTAATCAA AACCAGAGTA AAAAGTCTCA ATTAACAAAG
CTAGAAACTA TACCAGAAAT TCTTTGGGAA GCTGTTAGTT TAGCTAATTC TTTGGTTTCC
CAAATTAAAA GATTGTAG
 
Protein sequence
MRERYYPLNS LKEGHWFKLI CGASFQHLPT VRNLTLAYTL AGADCIDVAA DPAAIAATKE 
AVEVASQLKS WAKNHKFGYQ ARPLIMASIN DGEDPHFRKA EFDSTICPIE CWRPCEKVCP
AEAIVFFEKD NAINLGNSGV IDELCYGCGR CLSVCPNQLI QARSYVSTPS SIASLILQTG
VDAIEIHTQV GRETDFQRLW KSIKPWVNQL KLIAISCQDD EGLIEYLQSL YKIISPLPCS
LIWQTDGKPM SGDIGVGTTA ATIKLSQKVL DAGLPGYVQL AGGTNNSTVV KLIALGLLAN
KSLKQDKDYI NNFNIPISEN KFVAGVAYGS YARILLSPIL EKLEKMHKSQ LEAIEFAYDT
ASVNTKDVNQ NQSKKSQLTK LETIPEILWE AVSLANSLVS QIKRL