Gene Francci3_3918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3918 
Symbol 
ID3906877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4685662 
End bp4687209 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content68% 
IMG OID637881245 
ProductNADH dehydrogenase 
Protein accessionYP_482997 
Protein GI86742597 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.940984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTACG GGCAACCACC GCACATCCTC ATCGTCGGCG GAGGATACGT CGGCATGTAC 
ACGGCCCTGC GGCTGCGGCG GAAGCTGCGC CAGGACGAGG CATTGGTCAC AGTCGTCGAG
CCCAACTCGT ACATGACCTA TCAGCCTTTC CTCCCGGAAG CGGCGGCCGG CAACCTTGAG
CCACGGCACG TGGTGGTGCC GCTACGTAAG GTGCTGAAGG GCTGTCGGGT TGTCAGCGGA
AGCGCGCTTC AGGTGTCGCA TGGGACACGG ACCGCCGTGA TCAAACCATC CCTTGGCGAG
AAATTTGATC TTAAGTACGA CATCTTGGTG ATGTGTCCGG GATCGGTGGC GCGAACCCTG
CCAATCCCAG GGCTCGCGGA GCAGGGCATC GGCTTCAAGA GCGCGGCCGA GGCCATTTAT
CTCCGTAATC AGGTCATCAG CCGGTTGGAC GCGGCCGCCT CGGTGACCGA TCCCGCGGTC
CGGCGCCGGG CGCTGACCTT TCTCTTCATC GGCGGAGGGT ATGCCGGAAT AGAGGCTCTT
GCCGAGTTGG AGGACATGGC CCGCGATGCG TGTTCTTTCT ATCCTGATCT GAAACCGACG
GATATGCGTT GGGTCCTCGT TGAAGCCGCC GGCCGTATTC TTCCCGAGGT TTCACCCGGA
ATGGGGCTTT ATACCCTCCG GCAGCTCGAG CACCGGGGCA TCGACGTCAG GTTGAACACG
CGGGTGGAGA GCCTGGTCGG TGGGCGGGTT GTGCTGAATA ACGGTGAGGA GTTCGACGCG
GGCACCATCG TGTGGACGGC CGGGGTGCGG GCGAACCCGA TGCTGGCCGA CACGGATCTG
CCCTTGGACG ATCAAGGCCG GGTGCGCGCG ACCGTCTTCC TGCAGATCGA CGGGGTGGGT
GACGCGTGGG CCGCGGGTGA CTGCGCGGCC GTGCCCGACC TGACCAGGGG CGAGGATGTC
ACGACCGGTC CCTCGGCCCA GCACGCCGTC CGCCAGGCTC GCCGGCTGGC CCTCAACATC
CTCGCCGAGC TGCGCGGCGA GCCCCTCGAA CCATACGAGC ACAGCTATGC CGGCAGCGTG
GCGTCCCTGG GCCTGCACAA GGGTGTCGCC GAGGTCTACG GGGTCAAGCT GCGCGGCTGG
CCCGCCTGGT TCATGCACCG GACGTACCAC CTGAGCAGGG TTCCCACCCT CAACCGCAAG
ACCAGGGTGG TCGCGGACTG GTCGTTGGCA CTGTTCTTCC GCCGCGAGAT CGTCTCGCTC
GGATCCTTCG CCGATCCCCG GGCCGAGTTC CGCCGGGCGG CGATGCCGTC CGCGTTCGCC
GCGGCCATCT CCCCAGGCAC GCCCGGCACT CCGGTCACGA CGACCGGGCG GAACGGGACC
ACTCGCGCCC CCGCGTCCGC CGTCGCAGCC GACTCGCCCG GCGATGCCGA GGGCGGTGAG
GCCGAGGGTG GTGAGGCCGC GTCCCCGGGT GAGGATGTCG TCCGGGCCGC ACCGACCGGC
AGGATCTCCC GCGGCCGGAG CCGGACGCCC GGTCCGCGCG GCACGTGA
 
Protein sequence
MGYGQPPHIL IVGGGYVGMY TALRLRRKLR QDEALVTVVE PNSYMTYQPF LPEAAAGNLE 
PRHVVVPLRK VLKGCRVVSG SALQVSHGTR TAVIKPSLGE KFDLKYDILV MCPGSVARTL
PIPGLAEQGI GFKSAAEAIY LRNQVISRLD AAASVTDPAV RRRALTFLFI GGGYAGIEAL
AELEDMARDA CSFYPDLKPT DMRWVLVEAA GRILPEVSPG MGLYTLRQLE HRGIDVRLNT
RVESLVGGRV VLNNGEEFDA GTIVWTAGVR ANPMLADTDL PLDDQGRVRA TVFLQIDGVG
DAWAAGDCAA VPDLTRGEDV TTGPSAQHAV RQARRLALNI LAELRGEPLE PYEHSYAGSV
ASLGLHKGVA EVYGVKLRGW PAWFMHRTYH LSRVPTLNRK TRVVADWSLA LFFRREIVSL
GSFADPRAEF RRAAMPSAFA AAISPGTPGT PVTTTGRNGT TRAPASAVAA DSPGDAEGGE
AEGGEAASPG EDVVRAAPTG RISRGRSRTP GPRGT