Gene Franean1_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3401 
Symbol 
ID5671772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4032563 
End bp4033651 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content73% 
IMG OID641242289 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_001507709 
Protein GI158315201 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACC TCGACGAGTA CCGCGACCCC GCCCTCGCGC GGCAGCTCCT CGACGAGATC 
CACACGACGG CCACCCGGCC CTGGACCCTG ATGGAGGTCT GCGGCGGCCA GACGCACACC
ATCGTCCGCC AGGGCATCGA CAACCTGCTG CCGGCGGGCC TGCGGATGAT CCACGGGCCG
GGCTGCCCGG TGTGCGTGAC CCCGCTCGAA CTCATCGACA AGGCCCTGGC CATCGCCGCC
CGTCCCGAGG TCATCTTCAC CTCCTACGGC GACATGCTGC GGGTGCCCGG AACGGGGACC
GACCTGTTGG CCCTGCGCGC CCGCGGCTCC GACGTCCGCG TCGTCTACTC CCCGCTGGAC
GCGGTACGCC TGGCCGAACA GCACCCGGAC CGCCAGGTGG TCTTCTTCGC GGTCGGCTTC
GAGACGACCG CGCCGGCGAA CGCCATGGCG GTGCTGCGCG CCCACCAGCT CGGCCTGCCC
AACTTCAGCA TCCTGGTCAG CCACGTCCTC GTCCCACCGG CTATGACGGC GCTCCTCGAC
GCGCCCGACC GCCAGGTCCA GGGGTTCCTC GCCGCCGGCC ACGTCTGCGC CGTCATGGGC
TGGACGGAGT ACGAGCCCAT CGCGCACCGT TACCAGGTGC CCGTCGTCGT GACCGGCTTC
GAGCCGCTCG ACCTGCTCGA GGGCATCCTG ATGGCCGTCC GCCAGCTCGA GGCGGGCCAC
GCGCGGGTGG AGAACCAGTA CGCCCGCGCC GTCCACCGCG ACGGCAACAG CCGGGCGCGG
GAGGCCATCC GCCGCGTGTT CCGGGTGCGG GACCGCGCCT GGCGCGGCAT CGGCACCATC
CCGGACAGCG GCCTGGCCCT CACCGACGAG TTCGCCCGCT ACGACGCCGA GACCCGCTTC
GCCGTCTCCG GGCTGACCGC CCGGGAGCAT CCCGCCTGCA TCGCCGGCGA CATCCTCACC
GGCGCCCGCG AGCCGACCGA CTGCACCGCC TATGGGACGG CCTGCACGCC ACGCACCCCG
CTCGGCGCGC CGATGGTCTC CACCGAGGGC ACCTGCGCCG CCTACCACTC CGCCGGGAGG
GCGTCGTGA
 
Protein sequence
MRYLDEYRDP ALARQLLDEI HTTATRPWTL MEVCGGQTHT IVRQGIDNLL PAGLRMIHGP 
GCPVCVTPLE LIDKALAIAA RPEVIFTSYG DMLRVPGTGT DLLALRARGS DVRVVYSPLD
AVRLAEQHPD RQVVFFAVGF ETTAPANAMA VLRAHQLGLP NFSILVSHVL VPPAMTALLD
APDRQVQGFL AAGHVCAVMG WTEYEPIAHR YQVPVVVTGF EPLDLLEGIL MAVRQLEAGH
ARVENQYARA VHRDGNSRAR EAIRRVFRVR DRAWRGIGTI PDSGLALTDE FARYDAETRF
AVSGLTAREH PACIAGDILT GAREPTDCTA YGTACTPRTP LGAPMVSTEG TCAAYHSAGR
AS