Gene Francci3_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2538 
Symbol 
ID3904682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3001759 
End bp3003063 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content72% 
IMG OID637879865 
Productputative DNA-binding protein 
Protein accessionYP_481631 
Protein GI86741231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGGT GGCTGGGCCT GACACAGGCG CAGCTCAGCC GGATAGAGAA CGGCCGCGCC 
CCCGAAGAGC TGACCAAGCT GATTCGCTAC AGCCAAATCC TCGGCATCCC GGGGGAACTG
CTCTGGTTCG ACCGTCCCGG GGAGCCGCGC ACCCGGGCCA CCGGAGGGAG CCAGGCGCCG
GCGCTCACCG TGCCGGTCAT CGTCGGCGGG CAACAGGTGG CGCTTCCCAT TGACCGCGCC
GCCGCGCACT CCCACGGCCT CACTGATCTG GTCGCGGAGC TGGCTTCGGA CTCTGGCGAG
GCGGGCGCGC CCCAGGTGCC GATGCCCCGG GGGGAGCGGG CCGCGCCCGG GTCGGCCCAC
GTCGTCCCAC TGGCGAGCCT AGAGGAAATG CAGCACTTGG CCGCCGCGAT GCAGGACGCG
GGCCGCTACC TGGATGAGAC GGTGGTGGGC TACTTCGGCC AGCAGTTCAC CCGGTGCAAG
GCCAACGACG GCCAGATGGG GCCGCTCCGG GCGCTGCCCC TGGTGTTGGG CGTCCTCGAT
GCCATCCAGT CCCGCGCCCG GGATGTCCGC CCCCAGGTGC GGCGCGCTCT CCTCAGCGTC
GGGGCAGACG GGGCAGAGTT CGCGGGCTGG CTCTACCGCG ACTTGCACGA CGCTACGTCG
GCGGCCTACT GGTACGACCG GGCGATGGAG TGGGCGCAGG AGGCCGGGGA CCTGCCAATG
CAGGGCTACA TCCTGCTTCG TAAGTCCCAG TCCGCCTATG AGGACCGTGA CGCGACGCGG
GTGCTTACCC TCGCCCAGGC GGCGCGCTAC GGGCCGTGGA ACCTGCCGCC CCGGGTTCAG
GCAGAGGTCA CGCAGCAGGA GGCACGCGGT CTGGCGATGA CCGGCGACCC CATCGGCGTC
GTGGAGCAGA AGCTTGACGA AGCTCGCGCG CTGCTGGGGG CGGCCGACGA CGACCCGGAT
TCCCTGGGGG CCGGGTACGA CGAAGGAACC TGGTTGTTGC GGTCGGCCAC CTGCTACATC
GAGGCGGGAA AGCCGGGCCG GGCCGCGGCC CTCTACGGCG AAGTCCTGGC AACCGGCGCG
CTGTCCCGCC GGGACGAGGG CTACTACCGG GCACGCCGGG CTGTCGCTCA TGCGTTAGGC
GGCGAGCCTG ACGCCGCCGC CGAGGAAGGC CTGACCGCCC TGCGGCTGGC GACGGCCACC
GGCTCCAGCC GCACCACCCG GGAGTTGACG CGGGCCGCCC AGATATTGAC GCCCTGGCAG
ACCCGGCCCG GTCCCCGCCA GCTGCGCGCC GCCCTGCTGG TCTAA
 
Protein sequence
MGGWLGLTQA QLSRIENGRA PEELTKLIRY SQILGIPGEL LWFDRPGEPR TRATGGSQAP 
ALTVPVIVGG QQVALPIDRA AAHSHGLTDL VAELASDSGE AGAPQVPMPR GERAAPGSAH
VVPLASLEEM QHLAAAMQDA GRYLDETVVG YFGQQFTRCK ANDGQMGPLR ALPLVLGVLD
AIQSRARDVR PQVRRALLSV GADGAEFAGW LYRDLHDATS AAYWYDRAME WAQEAGDLPM
QGYILLRKSQ SAYEDRDATR VLTLAQAARY GPWNLPPRVQ AEVTQQEARG LAMTGDPIGV
VEQKLDEARA LLGAADDDPD SLGAGYDEGT WLLRSATCYI EAGKPGRAAA LYGEVLATGA
LSRRDEGYYR ARRAVAHALG GEPDAAAEEG LTALRLATAT GSSRTTRELT RAAQILTPWQ
TRPGPRQLRA ALLV