Gene Francci3_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1077 
Symbol 
ID3906420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1283031 
End bp1284002 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content69% 
IMG OID637878411 
Producthydrogenase (NiFe) small subunit (hydA) 
Protein accessionYP_480188 
Protein GI86739788 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.24331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCG TGCTGTGGTT CCAGGGCGGG GCGTGCAGCG GCAACACGAT GTCCTTCCTG 
AACGCCGACG AGCCCAACGT CGTCGACCTA ATCACCGACT TCGGGCTGGA ACTGCTCTGG
CACCCGTCGT TGGGGCTGGA GAACGGCGCC CAGGCCCGGG AGCTGTTCAC CGACTGCGCG
CGGGGTGAGC GGCCGGTCGA CATCTTCGTC TTCGAGGGTT CGGTGATCCG CGGCCCGAAC
GGAACCGGCG GCTTCGACGT CTTCGCCGAG CGGCCCATGC AGGACTGGGT GCGCGAGCTG
GCCGCCCGGG CCCAGGTGGT GGTGGCGATC GGGGACTGCG CGTGCTGGGG CGGGATCCCC
GCGACGGCGC CGAACCCGAC GGACTCCACC GGGCTGCAGT TCCACAAGCG TGAACGCGGC
GGTTTCCTCG GCCCGGACTT CCGATCGCGC TCCGGGCTGC CCGTCGTCAA CATTCCGGGC
TGCCCGGCCC ACCCGGACTG GATCACGCAG ATCATCGTGG CGCTGGCCAC CGGCCGGGCC
GCCGACATCG CGCTCGACGA GCTGCACCGG CCACGGACGT TCTTCACGAC ATTCACCCAG
ACCGGCTGCA CCAGGGCGGA GTACTTCGAA TACAAGCAGT CGACCCTGGC TTTCGGGGAC
GGGACCCGCA AGGGCTGCCT GTTCTACGAG TTCGGCTGCC GCGGCCCGAT GACCCACTCC
CCGTGCAACC GGATCCTGTG GAACCGCCAG TCGTCGAAGA CCCGCGCCGG CATGCCGTGC
ATCGGGTGCA CGGAGCCGGA GTTCCCATTC TTCGACCTCG CCCCCGGCAC GATCTTCAGG
ACCCGGAAGG TCGGCGGGCT CATTCCGCGG GAGGTGCCGG CGGGCAGCGG CCACCTCGGC
TACCTGGCAC ACGCGGCAGC TGCCCGGATA GTCGCCCCGC AGTGGTCGAA GGAGGACATG
TTCGTCGTCT AG
 
Protein sequence
MSSVLWFQGG ACSGNTMSFL NADEPNVVDL ITDFGLELLW HPSLGLENGA QARELFTDCA 
RGERPVDIFV FEGSVIRGPN GTGGFDVFAE RPMQDWVREL AARAQVVVAI GDCACWGGIP
ATAPNPTDST GLQFHKRERG GFLGPDFRSR SGLPVVNIPG CPAHPDWITQ IIVALATGRA
ADIALDELHR PRTFFTTFTQ TGCTRAEYFE YKQSTLAFGD GTRKGCLFYE FGCRGPMTHS
PCNRILWNRQ SSKTRAGMPC IGCTEPEFPF FDLAPGTIFR TRKVGGLIPR EVPAGSGHLG
YLAHAAAARI VAPQWSKEDM FVV