Gene Francci3_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2249 
Symbol 
ID3905017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2625361 
End bp2626563 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content73% 
IMG OID637879580 
Producthypothetical protein 
Protein accessionYP_481346 
Protein GI86740946 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.178896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0169259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGCT CCATGGGCCG GTCCGTGGAG CGGTCCGTGG GCCGGCTCGT CGACTCCACC 
CCGCCGTCCC GGGCGCGCGA CGTCGACTTC CTGCGCCTGG CGAGCGTCTG TGTCGTGGTG
CTGTGGCACT GGGCCCTGTC CATCGACCAC TTTCGCGGGG GCGTCCTCGT GATGCCCAAT
CCGATCGCGC GGATACCGTT CGCCTGGCTC GCGACCTGGC TGCTCCAGGT GATGCCGGTG
TTCTTCGTCA TCGGTGGCTT CGCCCACCTG GCGGCCTGGG ACGCGGCCGG TCGCGCCACG
GGGTCCCCGC GCGATGATCC CGCGCGTCCC TGGGGCCCGC GGGCACGGCG CTTCCTGCGG
GGGCGGCTGC GCCGCCTGTT GCCCCCGATG GCGGTGTTCG CCGTCGTCTG GGCGGCGGTC
GACGCCGTCC TGCTGCTCGC GGTCCCCGAC TACCCGGGTC TGCTGCGCTA TGGACGGGCC
GTGCTGGTGC CGCTGTGGTT TCTCGCGGCG TACCTGGGGG TCATCCTGGT CGTGCCGGTG
ACCGCGGCGG TCCATCGGCG GTTCGGCCGC CGGTTCATCC TGCTGTTGGG CGCGGTCGTG
GCGCTCGTCG ACCTGGCGCG GTTCGGCACC GGCAGCACGG TGTTCGGCTA TGTCAACACC
GGTCTCGTCT GGGTTTTCGC CCACCAGCTC GGGTATTTCT GGCGGGACGG CGTCCTGCGC
GGGCCGCGGC GGGCGCTGCT GACGGCGCTC TGCGGGCTGG CGGGGCTGGC CCTGGTGACG
ACGCTCGACG AGTATCCGCG ATCGATGGTC GCCACCGAGG GAGCCAGACG CGGCAACATG
TTCCCGACGA CCGCTGCGAT CGCCGTGCTC GCCGTCTTCC AGCTCGGTCT GATCCTGCTC
GCCGCACCCG CCCTGAACCG GATGCTGGCC CGGCGCCGGC CGTGGACGGC GGTCGTCACG
GGCAACGCCG TGATCATGAC CGTGTTCCTG TGGCACATGA CGGCCCTGCT GCTCGCGATG
GTCACGATGC GGGCGGTCGG GCTGCCGATG CCCGACGAAC CGACCGCGAC CTGGTGGGCC
GGGCGACCGC TGTGGGTGAT CCTGCCCGCG CTCTTCCTGG CGCCGTTGAT CGTCCTGTTC
GCGCCGGTGG AGCGCGGGGC CGCGGCCCCG CGTGGACCGG CGCGCACCGG GCCGGACGAC
TGA
 
Protein sequence
MGRSMGRSVE RSVGRLVDST PPSRARDVDF LRLASVCVVV LWHWALSIDH FRGGVLVMPN 
PIARIPFAWL ATWLLQVMPV FFVIGGFAHL AAWDAAGRAT GSPRDDPARP WGPRARRFLR
GRLRRLLPPM AVFAVVWAAV DAVLLLAVPD YPGLLRYGRA VLVPLWFLAA YLGVILVVPV
TAAVHRRFGR RFILLLGAVV ALVDLARFGT GSTVFGYVNT GLVWVFAHQL GYFWRDGVLR
GPRRALLTAL CGLAGLALVT TLDEYPRSMV ATEGARRGNM FPTTAAIAVL AVFQLGLILL
AAPALNRMLA RRRPWTAVVT GNAVIMTVFL WHMTALLLAM VTMRAVGLPM PDEPTATWWA
GRPLWVILPA LFLAPLIVLF APVERGAAAP RGPARTGPDD