Gene Plut_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_1446 
Symbol 
ID3744959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp1629697 
End bp1630785 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content53% 
IMG OID637769484 
ProductNi-Fe hydrogenase, small subunit 
Protein accessionYP_375348 
Protein GI78187305 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.123847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTGTA AACAGACCTT TGAAGAAGTA TTGAACGAAA GGGGGATCAG CCGCAGGAGC 
TTCCTGAAAT ATTGTGCACT TACGGCTGCC GCTCTCGGGC TCTCTCCGCT TATGGCCTCA
AAAATCGCCC ACGCCATAGA AACCGGACCA AGGACACCCG TTCTCTGGCT TCATGGTCTT
GAATGCACAT GTTGCTCAGA ATCCTTCATC CGCTCCTCCC ATCCCACCAT TGAAGACATC
CTCTTCAACA TGATTTCTCT CGATTACGAT GATATCCTGA GTGCCGCGGC TGGTACCCAG
CTGGAGGAGG TGCGAAGAAG GATCATGAAA GAATACAAGG GAAAGTACAT CCTTGCCATC
GAGGGCAACA TTCCGACAAA GGATGACGGT GTTTACTGTC TGGTAGGCGG GGACTCCTTC
CTCAACACCG TAAAAGAGAC CGCAGCTGAT GCGGCCGCCA TCATCGCCTG GGGAAACTGC
GCCTCGTTCG GCTGTGTACA GAATGCCCAT CCCAATCCAA CAGGAGCGGC TCCTGTTTCA
GACATCATAA AAAACAAGCC TATTGTCAAA GTACCGGGAT GTCCGCCCAT AGCAGAGGTG
ATGACCGGTG TTATAGCACA CTTCCACACT TTCGGAACGC TCCCGGAACT GGATCGCCTC
GGCCGCCCAA AAGCATTCTA CAACACCCGC ATACACGACA AATGCTATCG TCGCGCGTTT
TTCGATGCAG GAATGTTCGT GGAAAGCTTT GATGACGAAG CAACAAAAAA AGGCTGGTGC
CTCTACAAGA TGGGGTGCAA GGGGCCAACG ACCTACAACT CATGCTCAAA AATCCAGTGG
AACGGTGGCA CCAGCTTCCC CATCGGCTCC GGGCACCCCT GCATCGGATG TTCCGAGCCA
GGTTTCTGGG ATAATGGGCC CTTCTATGGA AGGCTCGCAA AAGTACCATT CCTCGGCAGT
GACAGCAATG CAGACAAGGT TGGCATAGTC GCCGTTGGGG CCGCAGCTGC CGGCGCAGCG
GCGCATGCGA CCGTTACAGC CCTGAAAAAG GCAAAACAAG GCGGTGAAGA AAATAAAGAT
AATGCTTAA
 
Protein sequence
MQCKQTFEEV LNERGISRRS FLKYCALTAA ALGLSPLMAS KIAHAIETGP RTPVLWLHGL 
ECTCCSESFI RSSHPTIEDI LFNMISLDYD DILSAAAGTQ LEEVRRRIMK EYKGKYILAI
EGNIPTKDDG VYCLVGGDSF LNTVKETAAD AAAIIAWGNC ASFGCVQNAH PNPTGAAPVS
DIIKNKPIVK VPGCPPIAEV MTGVIAHFHT FGTLPELDRL GRPKAFYNTR IHDKCYRRAF
FDAGMFVESF DDEATKKGWC LYKMGCKGPT TYNSCSKIQW NGGTSFPIGS GHPCIGCSEP
GFWDNGPFYG RLAKVPFLGS DSNADKVGIV AVGAAAAGAA AHATVTALKK AKQGGEENKD
NA