Gene Francci3_4259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4259 
Symbol 
ID3907226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5081099 
End bp5082178 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID637881585 
ProductDNA integrity scanning protein DisA 
Protein accessionYP_483334 
Protein GI86742934 
COG category[R] General function prediction only 
COG ID[COG1623] Predicted nucleic-acid-binding protein (contains the HHH domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.321081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGAC CACCCGGAGA CGACATCTTC CGGGCGACAC TGGCCGCGGT CGCGCCCGGA 
ACCCCCTTTC GCGACGGCCT GGAACGCATC CTGCGCGGGC ACACCGGCGC ACTGATCGTC
CTCGGCCACG ACAAGGTCGT CGAGGGTCTG TGCACCGGCG GCTTCGAGCT CGACGTGGAG
TTCTCGGCGA CCCGGCTACG CGAGCTGGCC AAGATGGACG GCGCGATCGT GCTGTCGTCC
GACCTGCAGC GCATCGTCCG CGCGGCGGTG CATCTGGTGC CGGATCCGAC GGTGCCGACA
GAGGAGTCCG GCACGCGGCA CCGAACCGCC GAGCGGGTCG CCAAGCAGGC CGAGTTCCCC
GTCATCTCGG TCAGCCAGTC GATGCACATC ATCGCGCTGT ACGTCGCCGG GCGGCGGTAC
GTGCTGGACG GCTCGGCCGC CATCCTGTCC CGGGCGAACC AGGCCCTGGC TACCCTCGAG
CGTTACAAGC TGCGGCTAGA CGAGGTCGCG GGCACCCTGT CCGCGCTGGA GATCGAGGAC
CTCGTCACGG TCCGCGACGC GATCTCGGTG AGCCAGCGGC TGGAGATGGT GCGCCGCATA
GCCGACGAGA TCGAAGGCTA CGTCGTCGAA CTCGGCACCG ACGGCCGGCT GCTGTCCCTG
CAGCTCGAGG AGCTGATGGC CGGGGTCGAG ACCGAGCGTG AACTCACCGT CCGCGACTAT
CTGCCGATCG GGTCGAAGGC GGGGACGCCC GCGCAGGTCC TGGGTGAGCT GTCCGCGATG
TCTCCGACCG ACCTGCTCGA TCTCACCGTC CTCGCCCGGG TGATCGGATT CTCCGGCGGG
GCGGACATCC TGGACCGGCA GATCAGTCCA CGCGGCTACC GCATGCTGGC GAAGGTGCCC
CGGCTGCCAC GGATGGTGGT CGACCGGCTC GTCGACCATT TCGGCACCCT GCAGAAACTG
CTCGCCGCCG GGGTCGACGA TCTGCAGGCC GTTGACGGCG TCGGGGAGAC CCGCGCCCGA
GCGGTCCGCG AGGGCCTCTC CCGGCTCGCC GAGTCAAGCA TTCTCGAACG CTACGTATAG
 
Protein sequence
MAGPPGDDIF RATLAAVAPG TPFRDGLERI LRGHTGALIV LGHDKVVEGL CTGGFELDVE 
FSATRLRELA KMDGAIVLSS DLQRIVRAAV HLVPDPTVPT EESGTRHRTA ERVAKQAEFP
VISVSQSMHI IALYVAGRRY VLDGSAAILS RANQALATLE RYKLRLDEVA GTLSALEIED
LVTVRDAISV SQRLEMVRRI ADEIEGYVVE LGTDGRLLSL QLEELMAGVE TERELTVRDY
LPIGSKAGTP AQVLGELSAM SPTDLLDLTV LARVIGFSGG ADILDRQISP RGYRMLAKVP
RLPRMVVDRL VDHFGTLQKL LAAGVDDLQA VDGVGETRAR AVREGLSRLA ESSILERYV