Gene Francci3_2846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2846 
Symbol 
ID3904758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3357356 
End bp3358633 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content73% 
IMG OID637880167 
Producthypothetical protein 
Protein accessionYP_481933 
Protein GI86741533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.542559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.824036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACT CATCCGCTCT TGATGTCCTG ATCGGGCTCG CGTTGCTCTT TGCCGCGTTC 
AGCCTCGCAG TCTCCCGGAT CAACGAGGCC GTGCTCGCTC TGTTCCGCTA CCGGGGGCGC
CAGCTGGAGG CCGAGCTGCG ACGGCTGCTC GGCGGTGACC GGCCCGCCGA ACCGTCCGAT
CCGGGGGCGG ACCCGCCCGC GGATCGGACG GACCTCGTCG CCGAACTGTT AGACGGGCCG
CTTCGGGCCA TGCGTGCCAC TGGCCGCGAC ACCTCCCTGC CGGCCATGGA CGACCATCCG
CCGGTTTCCG GTACCTGGGC CTCGGTGCGC CGCGCGCACA CACTGCGGTT GCCGTCCTAC
CTGCCGTCCA CCGCCTTCGC TCGGGCGCTG CTGGACCGGG TCGACCCGCC CGCCCGGGCC
CTGCTGTCCC AGCTCCGCCC GGACACCCTG CCTGACCACG TCCCCGACGA GGCCAGGGCC
GCGTACCGGC GGGCCTACGA CGGTGCCCGG CATGCCCTCG GGGAGAGGTC CGCCCAGGCC
CTGTACGACG CGATGCCTGT GGATCACCCC ACCGGGCGGG TCGTGGCCGC CGCGCTCGTC
GCCGCCACCA GGGCCGGAGC CGTGGGAACG ATGGAGGACG GGCTCGCCGC GATGCCCCCC
TCCCCGGCGA AGACCGCGAT GACCACAGCG ATAGTCCAGG CCGGCGGGGA CCGCGAGAAG
GTCGTCATCG AGCTGGCCCG GTGGTACGAC GACGCGATGG ATCGCCTGTC CGGGTGGTAC
AAGCGGCGCA TCGCCGTCTT CCTGCTCGGC TACGCGGTCC TGCTGTCCGT CCTGTTCAAC
CTCGACGCGA TCGGGCTCGC ACGGGCGTTC TGGCAGGACG GCACCGTGCG GCAGGCGGCC
GTGACCGCGG CCCAGGCCGA GGTGGGCTCA TCCGGCGAGG CGGCCGGGAG CTCCGCCGGG
AGTTTCGCCG GGGATCCCGC CTCCGGCACC CGCCTCACCT CCGACCAGCC CATGGTGGGG
GACGCCACGG AACAGGTGAT CAAGACGGTG CGCGAGGCCT CGGGGCTCGC CTTCCCGATC
GGCTGGGTCC ACAACTCCGC GGGCCGCGAC GACCCACGGG AGGTTCCCGA CTCCGTCGAG
GGGTGGCTCC TGAAAATCGC CGGCATCGCG ATCGCCTGCT TCGCGCTCAC CGCCGGCGCC
CCCTTCTGGT TCGACCTGCT CGGCAGACTG GTGAACATGC GCGCCACCGG CCCCAAACCC
CGGGCCGCCA ACGGATAG
 
Protein sequence
MPDSSALDVL IGLALLFAAF SLAVSRINEA VLALFRYRGR QLEAELRRLL GGDRPAEPSD 
PGADPPADRT DLVAELLDGP LRAMRATGRD TSLPAMDDHP PVSGTWASVR RAHTLRLPSY
LPSTAFARAL LDRVDPPARA LLSQLRPDTL PDHVPDEARA AYRRAYDGAR HALGERSAQA
LYDAMPVDHP TGRVVAAALV AATRAGAVGT MEDGLAAMPP SPAKTAMTTA IVQAGGDREK
VVIELARWYD DAMDRLSGWY KRRIAVFLLG YAVLLSVLFN LDAIGLARAF WQDGTVRQAA
VTAAQAEVGS SGEAAGSSAG SFAGDPASGT RLTSDQPMVG DATEQVIKTV REASGLAFPI
GWVHNSAGRD DPREVPDSVE GWLLKIAGIA IACFALTAGA PFWFDLLGRL VNMRATGPKP
RAANG