Gene Francci3_2587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2587 
Symbol 
ID3906493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3050754 
End bp3052286 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content77% 
IMG OID637879912 
Productrespiratory-chain NADH dehydrogenase domain-containing protein 
Protein accessionYP_481678 
Protein GI86741278 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.278958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC GCGCTGCCAC CGGAGCGGCC CAACCGTCCA TCCCCACCGG CACCAGCATC 
GCCACCGGCA CCAGCGACGT CGGCCGCGCC GGCTCCCCTG AGGCCCAGCC CGGGTGGGGG
CGCGAGCCCT GGCCCGCGGC CATGCACGCG GTACGGCCCG CGGCCGGCGC GGGCGGCCTC
CCCGCGCATC ACGTCCCGGC CGGACGGCTG CTGACCGCAG CCGCGTCCGA CCTCGCCGCG
CACGACCGAC AGTGTGGACC GCTGCCCTGG CGCGGCGGAC CCGGGCGGTT ACTTCCGGAG
ATCCACGACT CCGGGCTGAC CGGGCGGGGC GGCGCGGCCT TCCCCACCTG GCGGAAACTC
GCCGCGAGCG CCGAGGGCAC CTGCCTCGAC GGCAGCCACT CCGGCAGCGC GCACCGGGGC
AGCACGCACC GGGGCAGCCA GCACCGGAAC GCCGGGCACC GAGCCGACCG GTACCGCAGC
AGCGCGCACC CCGTGGTCGT CGCCAACGCC GCCGAAGGGG AACCCGAGAG CGCCAAGGAC
GTCACCCTGC TCACCGTGGC GCCCCACCTC GTCCTCGACG GCCTGCAGCT CGCCGCGGAG
GCGGTCGGGG CCGATGACGC CTTCGTCTAT CTCAAACCCG GTCCGGCGGT CACGGCGGTC
CGGCGGGCGC TGGCCCAACG GCGGGCCGCG GGCTGGGACC GGTTCACCGT CCAGATCCGG
GAGGCGCCGG AGACCTTCGT CGCCGGGGAG GCATCGGCCG TCATCGCAGC GCTGGAGGGA
GGGGCGGCCC GGCCGCGCGC GCACTGGCAA CCGCTCGCCG AGGCCGGTTT CCACGGCCGT
CCGACCCTGG TGCAGAACGC CGAGACACTC GCGCACCTCG CGTTGATCGC CCGGTGGGGA
GCCAGCTGGT TCCGCTCGGT CGGGACCGCC GAGGAACCGG GCACGTTCCT GGCCACCGTG
ACGGGAGCGG TCGCCGCGCC CGGTGTCGTC GAGGTGCCGT TCGGCACCCC GCTCGGCACT
CTCGCGCAGC TCGCGGGCGG CTTCACCGAG CAGGTCGGGG CCTTCCGGGT CGGCGGTTAC
AGCGGTGCGT GGCTGCCCGG CGGCCCGGGA GCGACGATCG CGATGTCCCG GGCGGCGCTG
GCGCCGTGGG GTGCCGCACC GGGCACCGGA GTGGTCGCCG TCCTCCCGGC CCGGGGCTGT
GGGCTCGTCG AGACCGCGCG CATCGTCGGG TACCTGGCCG CGCAGAACGC TGGCCAGTGC
GGGCCATGCG TCAGCGGGCT GCCCCAGCTT GCGGACGCCG TGGCCGGGAT GGCCCGAACG
GATGGCGGGT CCGGCGGCCC GGTGCAGGGG GCTGGCGATC CGGGACAGTC CGCCATCCGG
GCGCTGCGCC TCGCCGCCCT GGTCGCCGGT CGCGGCGCGT GCCACCACCC GGACGGCGCG
GCCCGCCTCG TGCACAGTGC GCTGCGCACG TTCGTCGACG ATATCCGGGC GCACGCCGAG
GGCCGCTGCC TCGGCTCGGC GTGCGCATCC TGA
 
Protein sequence
MTIRAATGAA QPSIPTGTSI ATGTSDVGRA GSPEAQPGWG REPWPAAMHA VRPAAGAGGL 
PAHHVPAGRL LTAAASDLAA HDRQCGPLPW RGGPGRLLPE IHDSGLTGRG GAAFPTWRKL
AASAEGTCLD GSHSGSAHRG STHRGSQHRN AGHRADRYRS SAHPVVVANA AEGEPESAKD
VTLLTVAPHL VLDGLQLAAE AVGADDAFVY LKPGPAVTAV RRALAQRRAA GWDRFTVQIR
EAPETFVAGE ASAVIAALEG GAARPRAHWQ PLAEAGFHGR PTLVQNAETL AHLALIARWG
ASWFRSVGTA EEPGTFLATV TGAVAAPGVV EVPFGTPLGT LAQLAGGFTE QVGAFRVGGY
SGAWLPGGPG ATIAMSRAAL APWGAAPGTG VVAVLPARGC GLVETARIVG YLAAQNAGQC
GPCVSGLPQL ADAVAGMART DGGSGGPVQG AGDPGQSAIR ALRLAALVAG RGACHHPDGA
ARLVHSALRT FVDDIRAHAE GRCLGSACAS