Gene Francci3_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0937 
Symbol 
ID3906101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1100991 
End bp1102472 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content75% 
IMG OID637878271 
Productpyridine nucleotide-disulphide oxidoreductase dimerisation region 
Protein accessionYP_480050 
Protein GI86739650 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0486491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.876229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCG ACGTGGACGT GATCGTGCTC GGCGCCGGGC CCGCGGGCGA GAACGTCGCC 
GGCCGCTGTG CCGACGCTGG TCTGTCCGTC ATCGTCGTCG AACGCGAGCT CGTCGGCGGG
GAGTGCAGCT ACTGGGGCTG TATCCCGAGC AAAACCCTCC TGCGGCCGGG GGAGGTGCTG
GCGGCGGCCC GCCGCGTTCC CGGCGCGGCG TCGGCGGTCA CCGGACCCGT CGACGCCGCC
GCGGCGTTCG CCCGGCGAGA CTGGATGGTG GGCGACTACT CCGATTCCTC CCAGGTGCCC
TGGCTGGTCG ACAAGGGCGT CGAGTTGGTG CGCGGGACCG GGCGCCTGGC GGGGCCCGCC
GGCCGGGTCG CGGTGACGCT GCTGGACGGC ACCCGCCGGG TGCTGACCGC GTCCCGAGCC
GTGGTCGTGG CCACCGGGAC GCGGGCCACT GTTCCACCGA TCCCCGGCCT CGCCGACGCC
GAACCGTGGG ACAACCGGAC CGCGACCGGC GCCTGGAAGG TCCCGCACCG CCTCGTGGTG
CTCGGCGGCG GCGCCGTGGG GGTGGAACTG GCCCAGGCCT TCCGCCGGCT CGGTAGCGCG
GAGGTGACCT TGATCGAGGG GTCACCCCGG CTGCTCGTCC GGGAGGAGGA GTTCGTCGGC
GAGCAGTTGC GGGCCGCCTT CGAGGCGGCC GGCATCACGG TGCGGCTCGG AAGCCGGGCG
GTCGAGGTCC GCCGGGCCCG GACGGGACCG GATGGCCCGG CCGGGCCGGT GACCGTCACG
GTGGAGCCGG TGGATCCGGC GGATCCGGCG GATCCGGGTG AGCCGGTTGA GCGGCGGCCG
CGCCGCGATG CCGTGGTCGC CGACGAGATC CTGGTCGCCG TCGGGCGCAC CCCGGCGACC
GGCGACCTGG GCCTGGAGAC CGTGGGCCTG CAACCCGGGC GGTTCATCGA GGTCGACGAC
CGGCTCCGCG CGGTCGGCGT GGCGGGGGAC TGGCTCTACG CGGTCGGCGA CGTCTGCGGC
CGCGCGCTGC TCACCCACAT GGGCAAGTAC CAGGCCCGGC TGGCGGCCGA CGTCATCACC
GGCCGCCGGG ACCTCGCCGG CGCTCCCGTG CGCGACCTGG CCGACCCGGC GGCCGTCCCG
CGGGTGACCT TCACCGATCC GCAGGTCAGC GCGGTCGGAC TGACCGAGCG GGCCGCGCGG
GAGGCCGGGA TCGCCGTGCG GGCCGTGAGC GTGCCGACCG GATCGGTGGC CGGCGCGTCG
GTGCGGGGGG AGGAGATCAC GGGCACGTCG AAGATTGTCG TGGACGAGGC CCGCCGGGTG
CTCGTGGGAG CGACCTTCAC CGGTCCGGAC ACCCAGGAGA TGGTGCACGC GGCCACGATC
GCGATCGTCG GAGAGGTGCC GTTGGAGCGG CTGTGGCACG CGGTCCCGTC GTTCCCGACA
GTCAGCGAGG TCTGGCTGCG ACTGTTGGAG GTATACGGAT GA
 
Protein sequence
MDADVDVIVL GAGPAGENVA GRCADAGLSV IVVERELVGG ECSYWGCIPS KTLLRPGEVL 
AAARRVPGAA SAVTGPVDAA AAFARRDWMV GDYSDSSQVP WLVDKGVELV RGTGRLAGPA
GRVAVTLLDG TRRVLTASRA VVVATGTRAT VPPIPGLADA EPWDNRTATG AWKVPHRLVV
LGGGAVGVEL AQAFRRLGSA EVTLIEGSPR LLVREEEFVG EQLRAAFEAA GITVRLGSRA
VEVRRARTGP DGPAGPVTVT VEPVDPADPA DPGEPVERRP RRDAVVADEI LVAVGRTPAT
GDLGLETVGL QPGRFIEVDD RLRAVGVAGD WLYAVGDVCG RALLTHMGKY QARLAADVIT
GRRDLAGAPV RDLADPAAVP RVTFTDPQVS AVGLTERAAR EAGIAVRAVS VPTGSVAGAS
VRGEEITGTS KIVVDEARRV LVGATFTGPD TQEMVHAATI AIVGEVPLER LWHAVPSFPT
VSEVWLRLLE VYG