Gene Francci3_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3520 
Symbol 
ID3905254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4204803 
End bp4205693 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content65% 
IMG OID637880842 
Productextracellular solute-binding protein 
Protein accessionYP_482602 
Protein GI86742202 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0548658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.231474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGCG AACGCCGCCG CAGGCGCGGC GGCGCGCTCC TCGCCGGCGC GACGCTGCTC 
GCCGCCCTCG GACTGAGTGC CTGTGGGAGC AGCGGCGATG ACACCTCGAC ACCCACCCCC
TCGGTGACCT TCACCGCCGG CACCACCATG GCCCGGCTGC ACGACGCCGG AAAGATCATC
ATCGGGACGA AGTTCGACCA GAACCTGTTC GGGCTGAAGA ACCTCAGCGG CCAGCCCGAG
GGCTTCGACG TCGAGATCTC GAAGATCGTC ACGGATGCCC TGGGCATCCC GCGCGACAAG
GTCTCCTACG TGGAGACAGT GTCGGCCAAC CGTGAACCCT TCATCCAGCA GGGCCGGGTC
GACCTCGTGG TGGCCACCTA CACAATCAAC GACAAACGAA AGAAGGTCGT CGACTTCGCG
GGGCCGTACT ACGTCTCCGG CCAGTCCATC ATGGTCTCGA AGAACAACAC CGACATCACC
GGCAAGGACA CCCTTGCCGG CAAGAAGGTG TGCTCGGTCA GCGGATCCAC GCCGGCCGAG
AACATCCGCC GGGTGGCGCC CACCGCCCAG CTCGTGCTGT TCGATGTGTA CAGCAAGTGC
GCCGACGCGT TGAAGAACGG TCAGGTCGAC GCGGTCACCA CGGACAAGGG CATCCTGCTG
GGTCTCGTCG ACAAGGACCC TGACGCCTTC AAGGTGGTGG GTGGCACCTT CACGAAGGAG
CCCTACGGCA TCGGCCTCAA GAAGGGTGAC GACGCGTTCC GGAACTTCAT CAACGACACG
CTCGAGGCCG CTTACAAGGA CGGGCGCTGG GAGAAGGCGT ACACCTCGAC CCTCGGCAAG
GTGGAGCCGA CCGTGCCCAC TCCCCCGGCC GTCGACCGGT ACACGTCCTG A
 
Protein sequence
MSRERRRRRG GALLAGATLL AALGLSACGS SGDDTSTPTP SVTFTAGTTM ARLHDAGKII 
IGTKFDQNLF GLKNLSGQPE GFDVEISKIV TDALGIPRDK VSYVETVSAN REPFIQQGRV
DLVVATYTIN DKRKKVVDFA GPYYVSGQSI MVSKNNTDIT GKDTLAGKKV CSVSGSTPAE
NIRRVAPTAQ LVLFDVYSKC ADALKNGQVD AVTTDKGILL GLVDKDPDAF KVVGGTFTKE
PYGIGLKKGD DAFRNFINDT LEAAYKDGRW EKAYTSTLGK VEPTVPTPPA VDRYTS