Gene Franean1_1491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1491 
Symbol 
ID5669895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1787674 
End bp1791786 
Gene Length4113 bp 
Protein Length1370 aa 
Translation table11 
GC content70% 
IMG OID641240411 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001505837 
Protein GI158313329 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein)
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.362734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACGA AGGTCGCGCT GGACCGGGTC CGGACCGTGT GCGCGTACTG CGGTGTCGGC 
TGCGGGATCG TGCTCGAGGT CGACCGCAGC GGGGACAGCC CGCGGGTTGT CCGTTCCACG
GGGGATCCTG ATCATCCGGC CAACCGTGGC CGGCTGTGCA CCAAGGGTGC GACCACCGCC
GAGCTGCTCA ACGCGCCTGG ACGCCTCACC ACCGCGCTGG CCCGGGCGGG CCACGACGAG
GAGCCGGCCC CGATAGACGT CGACACGGCG CTCGACCTCG CCGCCGCCCG CCTCGCGGCG
CTGCGCGACG AACACGGCCC GGACGCCATC GGCATCTACA CGTCCGGCCA GCTGAGCATC
GAGGCGCAAT ACCTGATCAC CAAGCTCGCC AAGGGCTTCC TCCGTACCCA GTACCAGGAG
TGCAACAGCC GGCTGTGCAT GGCCAGCGCC GCCTCGGGCT ACAAGCTCAG CCTCGGAGCG
GACGCTCCCC CCGGCAGCTA CGACGACTTC GACCACGCCG ACGTGTTCCT CGTCATCGGC
GCGAACATGG CCGACTGCCA CCCGATCCTG TTCCTGCGGA TGATGGATCG CGTCAAGGCC
GGCGCGAAGC TCATCGTGGT CGACCCCCGC CGGACGGCGA CGGCGGACAA GGCCGATCTG
TTCCTCCAGC TCCGACCGGG CACGGACATC GCACTGCTCA ACGGACTGCT TCACCTGCTG
CACGCCTCCG GCGCGGTCGA CGAGTCGTTC GTCGCCGCCC ACACCGAGGG CTGGGCCGCC
ATGCCCGACC TCCTCGCCGG CTACGACCCG GCCACCGTCG CCGACATCAC CGGCGTCCCG
GAGGACGACC TACGCGCCGC CGCGGCACTC ATCGCGGCCG CCGGGAGCTG GATGAGCTGC
TGGACGATGG GACTCAACCA GTCCACCCAC GGCACCTGGA ACACCAACGC TCTGATCAAC
CTCCACCTCG CGACCGGCGC GATCTGCCGC ACCGGGAGCG GGCCGTTCTC GCTCACCGGA
CAGCCCAACG CGATGGGCGG CCGCGAGATG GGTTACATGG GGCCGGGCCT ACCCGGGCAG
AGGTCGCTCC TGGACCCCGC CGACCGCGCA CACATCGAGC AGCTCTGGGG CCTGCCGGCG
GACACGATCC GCGCGGACCA CGGCAACGGC ACGATCGGAA TGTTCGAGCA GATGTCAGCC
GGGGCGATCA AGGCGGCGTG GATCATCTGC ACCAACCCGG TCGCCACCGT CGCCAACCGT
CGCACCGTCA TCGAAGGTCT GGAGACGGCC GAGTTCGTCC TCGTCCACGA GGCCTTCACC
GAAACGGAGA CCACCGGCTA CGCCGACGTC GTCCTGCCCG CGGCCGTCTG GGCGGAGACG
GACGGCGTGA TGGTGAACTC CGAACGGAAT CTCACCCTCA CCCGACCCGC GGCAGACCCA
CCCGGCCAGG CCCGCGCCGA CTGGCAGCTC ATCGCGGGAA TCGCCACCCG CCTGGGATAT
GCCGACCACT TCACCTACAA CAGTGCGCAG GAAGTGTTCG ACGAGCTACG TCAGGCGTGG
AATCCGGTGA CCGGATGGGA CCTACGCGGC ATCACCCACG GCAGACTGCG GGACGGGCCG
ATGCAGTGGC CCGCACCGCC AGGCGCCGGC GCTCGCAACC CGATCCGTTA CATGACCGAC
GACGGCCCAC GGTTCCCGAC TCCGTCCGGG AGGGCGAAGT TCTGGCCGCG GCCGCACGTC
GACCCGCAGG AGATACCCGA CGAGAAGTAC CCATATCTTC TCAACACGGG CCGCCTCCAG
CACCAGTGGC ACACCCTGAC CAAGACCGGA CGCGTCGGGC GGCTCAACAA GCTCAACCCG
GCTCCATTCA TCGAGATTCA CCCGGACGAC GCCGCGGCAC TCGGCATCTC CGAAGGTGAC
CAGGTGGAGG TCTCCTCGCG GCGAGGGCGC GCCGTCCTTC CCGCCGCCGT CACCGACCGG
GTCCAGCCGG GCAACTGCTT CGCCCCGTTC CACTGGAACG ACATGTTCGG CGAGTACCTC
GCGGTGAACG CCACGACATC CGACGCGGTC GACCCTGTCT CCCTGCAGCC CGGGTTCAAG
GCCAGCGCCG TCGCGCTCAC GCGGGTCCAC ACGCCCACCC CGGCGCAGCC GCCGCCCACC
CTCGTCGACG ACGTACTTCA CGCGGACGCC GTGGCTTCCG CCGCGCCCAT CGCGAGCCTG
CTCGGGCTCG GTGGGACAGC CGCCCCCCCG GCACTGAACG ACGACGAAAG CGTCTATCTG
GCCGGCTTCG TGGCGGGCCT GGGCCTCGAT CTCGCGCGCG GAGCGCTCAC CTCGGTACCC
ACGCTCCCGC CCTCGGCACC AGTCACCGCG CGGACGGCCC TGTGGGTGAA CGGCGCACTG
GCCGGCCTCT GGTCACGCCT TCCGGCCGAA GGACGCGAGA TCTTGCCAGG GCGCGGGGTC
TCGCCGCGCG AGGACCCGCC GGGGACGGGC GGGCGCCGGG TGCTGCTCGG CTGGGCCTCC
CAGACCGGCA ACGCGGAACG GGTCGCCGAA CAGGCCGAGG CGAGGCTGAC CGAGTCCGGC
ACCGTCGTCA CCACGCTGCC CCTCAACGAG ATCGACCACA TCGCCCCGCA CACCGACCTC
GTGGTCGTCA CGAGCACCTT CGGCGACGGG GAGTCCCCCG ACAACGGCTC CGAATTCCTG
GCGAGGCTGC GAGCTCGCTC GATGCCGCTC GACGACGTCC GCTTCAGCGT GATCGCCCTC
GGCGACAGCA GCTACAGCGA CTTCTGCGGC CACGGTCGGC GCATCGACGA GCTGCTCGCC
GAACTCGGCG GACTGCGCCT GGCCGACCGG GTCGACTGCG AGCCGGATTT CGACGAGGCG
GCCCGGGCCT GGCTCGACCG GGTGCTTGCT GCTCGCGCAG GGGCCTCCCC CACCAGACCC
GAGCCTGTGC CTGTGCCTGT GCCGCCGGTG TCGCGACTCC CGGGCGCGTC CTCGGGCCCG
CGGGTGAGTG AGACACACCG GGTGATCCTG GCGGGCAACC GACTGCTCAG CGGCGTGGGT
TCAGCCAAGG AGGTTCGTGA GTTCCTGATC GACCTGGAGG ACAGCGACCT CACTTACGAG
ACCGGTGACG TCCTCGTCGT GCGGCCCAGC AATCGCGCCG AGCTGGTCGA CGAATGGCTC
AGCACGACAG GTCTGCGCGG CGACACGATC GTCGAACTGC TCCAGTTCGG CCGGACGGAG
CTCGCCGTCG CCCTACGCGA CCATCTTGAG ATCGCGAAGC CATCCCGCGC GTTTCTCGCC
TTCGTGGCTG AGCGCAGCGG CTCCCGCCAC CTCCAACGTC TGCTGCGCGG CGACCGGGCA
GATCTTGACG GGTGGCTTTG GGGACGGCAG ACCGTGGATG TGATCACCGA GACCGGCACG
CACGCCTCCG CGCAGGAGTG GTGCGAGCTC CTCAAGCCCC TACGTGAACG CCGGTACTCG
ATCTCGTCGT CGGCCGAAGT CTCGCCCCGA CAGATTCGGA CGCTCGTGTC CGTGGTGCGC
TACAACGCAC CCTCGGGCGC GGCACGCACC GGTGTGGCGT CAGCATTCCT CGCCGACGCT
CCGGCCGGTA CGCCATTAAA GGTCGCCGTC GCCCGCTCCT CGGGATTCGC CCCACCGTCC
GACCCGGAGA CGCCGACGAT CATGATCGGC CCGGGCACCG GGCTGGCCCC GTTCCTCGCC
TTCCTGGACA CGCGCCGGGC CCGCGGTCAC ACCGGTCGCA ACTGGCTCTT CTTCGGCGAG
CAACGCAGGG CGACCGACTT CTACTACAGC GATGAGCTCG CCGAGTTCGC GCGGTCGGGA
CTGCTGACCC GACTCGACAC GGCCTTCTCC CGCGACCAGC GAGCCAAGAT CTACGTGCAG
GACCGTATGC GCGAGCGAGG CGCACCGCTG TGGCAATGGC TGCGCGAGGG CGCAGCAGTC
TACGTGTGCG GAGACGCGGC GCGGATGGCC AGGGACGTCG ACACCGCTCT GCGCGACGTC
GTGGCGAAGC ACGGGGACCT GAACGCCGCA CAGGCCGGCG ACTATGTCCG ACAGCTCATT
ACCGACGGAC GCTATCTTCG CGACGTCTAC TGA
 
Protein sequence
MTTKVALDRV RTVCAYCGVG CGIVLEVDRS GDSPRVVRST GDPDHPANRG RLCTKGATTA 
ELLNAPGRLT TALARAGHDE EPAPIDVDTA LDLAAARLAA LRDEHGPDAI GIYTSGQLSI
EAQYLITKLA KGFLRTQYQE CNSRLCMASA ASGYKLSLGA DAPPGSYDDF DHADVFLVIG
ANMADCHPIL FLRMMDRVKA GAKLIVVDPR RTATADKADL FLQLRPGTDI ALLNGLLHLL
HASGAVDESF VAAHTEGWAA MPDLLAGYDP ATVADITGVP EDDLRAAAAL IAAAGSWMSC
WTMGLNQSTH GTWNTNALIN LHLATGAICR TGSGPFSLTG QPNAMGGREM GYMGPGLPGQ
RSLLDPADRA HIEQLWGLPA DTIRADHGNG TIGMFEQMSA GAIKAAWIIC TNPVATVANR
RTVIEGLETA EFVLVHEAFT ETETTGYADV VLPAAVWAET DGVMVNSERN LTLTRPAADP
PGQARADWQL IAGIATRLGY ADHFTYNSAQ EVFDELRQAW NPVTGWDLRG ITHGRLRDGP
MQWPAPPGAG ARNPIRYMTD DGPRFPTPSG RAKFWPRPHV DPQEIPDEKY PYLLNTGRLQ
HQWHTLTKTG RVGRLNKLNP APFIEIHPDD AAALGISEGD QVEVSSRRGR AVLPAAVTDR
VQPGNCFAPF HWNDMFGEYL AVNATTSDAV DPVSLQPGFK ASAVALTRVH TPTPAQPPPT
LVDDVLHADA VASAAPIASL LGLGGTAAPP ALNDDESVYL AGFVAGLGLD LARGALTSVP
TLPPSAPVTA RTALWVNGAL AGLWSRLPAE GREILPGRGV SPREDPPGTG GRRVLLGWAS
QTGNAERVAE QAEARLTESG TVVTTLPLNE IDHIAPHTDL VVVTSTFGDG ESPDNGSEFL
ARLRARSMPL DDVRFSVIAL GDSSYSDFCG HGRRIDELLA ELGGLRLADR VDCEPDFDEA
ARAWLDRVLA ARAGASPTRP EPVPVPVPPV SRLPGASSGP RVSETHRVIL AGNRLLSGVG
SAKEVREFLI DLEDSDLTYE TGDVLVVRPS NRAELVDEWL STTGLRGDTI VELLQFGRTE
LAVALRDHLE IAKPSRAFLA FVAERSGSRH LQRLLRGDRA DLDGWLWGRQ TVDVITETGT
HASAQEWCEL LKPLRERRYS ISSSAEVSPR QIRTLVSVVR YNAPSGAART GVASAFLADA
PAGTPLKVAV ARSSGFAPPS DPETPTIMIG PGTGLAPFLA FLDTRRARGH TGRNWLFFGE
QRRATDFYYS DELAEFARSG LLTRLDTAFS RDQRAKIYVQ DRMRERGAPL WQWLREGAAV
YVCGDAARMA RDVDTALRDV VAKHGDLNAA QAGDYVRQLI TDGRYLRDVY