Gene Franean1_6087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6087 
Symbol 
ID5674408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7409281 
End bp7410669 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content68% 
IMG OID641244939 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001510337 
Protein GI158317829 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0085721 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0109132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGA CCTCGACGCT CCTGCTCGCC GCGAACGTCG GCGACCCGGA CATGTCCGTC 
CTCACCGACG ACCCGTTCTG GCTCATCCTG ATCAAGGCTG TCGCGGTCTT CGCGTTCCTG
CTGGTGATGA CGCTGTTCTC GATCGTCTTC GAGCGCAAGG TCGTCGCGAA GATGCAGCAG
CGGGTCGGGC CGAACCGGCA CGGCCCGCGC GGCTGGCTGC AGAGCCTCGC CGACGGCGTG
AAGCTCATGC TCAAGGAAGA CATCATCCCG ACGCTGGCCG ACAAGCCGAT CTTCATCCTG
GCTCCGGTCA TCTCGGCGGT GCCGGCCATC CTGGCCTTCG CCTGCATCCC GTTCGGGCCG
GAGGTGTCGA TCTTCGGTGA ACGCACGACG CTCCAGCTCG CCGACCTGCC GGTCAGCGTG
CTGTACCTGC TGGCGGCGGC GTCGCTGGGC GTGTACGGAC TGATCCTCGC CGGCTGGTCC
AGCGGCTCGA CCTACCCGCT GCTGGGCTCG CTGCGCTCGG CCGCGCAGAT CATCTCCTAC
GAAGTCGCGA TGGGCCTGGC CTTCGTCGCG GTCTTCGTCT ACGCGGGAAC ACTGTCGACC
GCGGGCATCG TGCAGAGCCA GCACGACTGG TGGTACATCG CGCTGCTGCC GTCGTTCATC
CTCTACTGCA TCGCGATGGT CGGTGAGACG AACCGGACGC CGTTCGACCT CCCCGAGGCC
GAGGGCGAGC TGGTCGGCGG GTTCCACACC GAGTACAGCT CGATCAAGTT CGCGTTCTTC
TTCCTCGCCG AGTACATCAA CATGGTCACC GTCTCGGCGA TCGCGACCAC ACTGTTCCTC
GGAGGCTGGC AGCCGCCGCC GATCCCGGGC CTGTCCGGGC TGGACCACGG CTGGTACCCG
CTGATCTGGT TCGTCATCAA GCTGCTGCTG TTCATCTTCG TGTTCATCTG GCTGCGGGGC
ACGCTGCCGC GGCTGCGCTA CGACCAGTTC ATGGCCTTCG GCTGGAAGGT CCTGATCCCG
GTCGGCCTGG TGTGGGTGCT CGCGGTCGCG ACCTTCCGCG CCTACCAGGA GCACGTGAGT
GACCGGACGC CATGGCTGAT CGGCTTCGGG GTCGTGGTGG GCGTCCTGCT CGTGGTTGCG
ATCATCGACC CGGGCGCGGC GAGGGCCCAG CGCGAGCAGG AGGAGGCCGA GCGGGAACGC
GCCGAATCGG CCCCGAGCCT GGACAGGATT CCCTGGCCGC CGCCGTCCGA CGGCGCGACG
AGGGCACTGG CCGGCCGCAC TGCGGCTGGC AGCACTGCAG CCGGCAGCGG CGCGGCCGGC
TCCGCCGGCG GCGACAAGGG AAACACCACC GTCATCCCCG CGGGCTCCGG CCCGCGACAG
GAGAGCTGA
 
Protein sequence
MTGTSTLLLA ANVGDPDMSV LTDDPFWLIL IKAVAVFAFL LVMTLFSIVF ERKVVAKMQQ 
RVGPNRHGPR GWLQSLADGV KLMLKEDIIP TLADKPIFIL APVISAVPAI LAFACIPFGP
EVSIFGERTT LQLADLPVSV LYLLAAASLG VYGLILAGWS SGSTYPLLGS LRSAAQIISY
EVAMGLAFVA VFVYAGTLST AGIVQSQHDW WYIALLPSFI LYCIAMVGET NRTPFDLPEA
EGELVGGFHT EYSSIKFAFF FLAEYINMVT VSAIATTLFL GGWQPPPIPG LSGLDHGWYP
LIWFVIKLLL FIFVFIWLRG TLPRLRYDQF MAFGWKVLIP VGLVWVLAVA TFRAYQEHVS
DRTPWLIGFG VVVGVLLVVA IIDPGAARAQ REQEEAERER AESAPSLDRI PWPPPSDGAT
RALAGRTAAG STAAGSGAAG SAGGDKGNTT VIPAGSGPRQ ES