Gene Francci3_3973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3973 
Symbol 
ID3906933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4754237 
End bp4755577 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content73% 
IMG OID637881301 
ProductFHA domain-containing protein 
Protein accessionYP_483052 
Protein GI86742652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.825925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGAC TGGACCGTTT CCGGGTGGTC GTGGAATCCG GGCGCGGAGA TCGGACGGGA 
GTGGTCGTCC GGCTGCCCGG CGCACTCATC GTCGCCTGTG CCGGCCGGGC GGAGACGGCC
GAGACGACCA CCAGGCTGCT GGCGCTGTGC GCCGAGGTCG CTGCCGAGGT GGGCGCCACC
GCGTCGACCA TGGGTCGCCG GCTGGTGCGT CGGGTCGCGG GGCTGCTGGC CGATGCCGAT
CCGGATCGGG TGCCCGACTT CAGCCTGCTG ACCACGGTCA ATGACCGGGT CGCCGCGCTG
GTCCACGGGG CGATGGACGT CGTCGCGACG GGCAGCTGCG GAGTGACGCT CTCCGGCGTC
GACTCGGCCA CGTGGGTCGA CCGGCTACTG CCCACCGAGA TCAGCCGGAT CGACGTCGGC
CCCACCGGCC TGGTCGGGCC GACGGGTTTC CCCGGCGGGC TCGGCGATCT CGGTTTCCCG
CTTGACCTGC GCATCGGGGC CGTGCCCGGG ATCGGGGTGA GTCTGCTGCT CAGCGACACG
CCGTCGCTGC CCGCGCCGAA GGCCTCCGCC GAGCAGCTGC TCGCCGGATT CGATCCGGTC
CGGGAGCCGA TGACCGGCGC CGCGCCCATC CCCACGCCGG GAACCCGCGC CCCCGATCCG
ATGCCGACCA CCCCGCCGGC GCTCGCCCCC CTGCTCAGCA AGGAGGAGGA GGCCCATCGA
CGCCGCGCCG CCGCCGAACC GACGCAGGCG GCCGACCTCG ACGAGCTCGA CGAGCTCGAC
GCCCTGACCC AGCTCCCGGG CCAGAGCTTC ACCGTCTCCG ACCTCATCGA GGACGACGAG
GCACCGACGA TGCTGCCGAG CAGCGGCGAG CCTCAGGTCG AGGGTGTGCT GTGCGCCAAC
GGCCACTTCA ACCACCCGCA GGCGCCGTAC TGCTCCGAGT GCGGCCTGTC GCTCGCCCAG
CAGAACACCC GCACGGTCTG GGGTCCCCGG CCGCCCGTCG GCGTCCTCGT CTTCGACGAC
GGCCAGACCA TGAACGTCGA CATGGACCTG GTGATCGGCC GCCAGCCGGA CCGCGACGAT
GCGGTCCGGG CCGGGAAGGC ACGGGCGCTG CCGGTCGAGG ACGGTGAGAG CGCCGTCTCC
CGGGTGCATG CCGTCATCAC CCTCAACGGT TGGGACGCGG TCATCACCGA CCAGGGTTCG
GCGAACGGCA CCTACATCGC CCCGCCGGAG GCGACCGTGT GGACGCCGCT GAGCCCGCAC
CAGCCGGCTC CCCTGATCCC CGGCACCCGC GTGCAGGTGG GCAAGCGGAC GTTCGTCTTC
AACTCCCACC TGCACGTTTG A
 
Protein sequence
MAGLDRFRVV VESGRGDRTG VVVRLPGALI VACAGRAETA ETTTRLLALC AEVAAEVGAT 
ASTMGRRLVR RVAGLLADAD PDRVPDFSLL TTVNDRVAAL VHGAMDVVAT GSCGVTLSGV
DSATWVDRLL PTEISRIDVG PTGLVGPTGF PGGLGDLGFP LDLRIGAVPG IGVSLLLSDT
PSLPAPKASA EQLLAGFDPV REPMTGAAPI PTPGTRAPDP MPTTPPALAP LLSKEEEAHR
RRAAAEPTQA ADLDELDELD ALTQLPGQSF TVSDLIEDDE APTMLPSSGE PQVEGVLCAN
GHFNHPQAPY CSECGLSLAQ QNTRTVWGPR PPVGVLVFDD GQTMNVDMDL VIGRQPDRDD
AVRAGKARAL PVEDGESAVS RVHAVITLNG WDAVITDQGS ANGTYIAPPE ATVWTPLSPH
QPAPLIPGTR VQVGKRTFVF NSHLHV