Gene Francci3_2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2423 
Symbol 
ID3906406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2814377 
End bp2815408 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content74% 
IMG OID637879753 
Productextracellular solute-binding protein 
Protein accessionYP_481519 
Protein GI86741119 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0450755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.051876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCA TCCGGGTCGC GACGGCGCGA GCGGTGACGG TACTGATTAC CGTCGCCCTG 
GCGGTCGGCG TCACCGCCTG CGGCACGGTC GACAGTCCGC TGCCGGCCCA GGCCCGACCC
CTGGCCCCCG CGACGCCGAC GCCCCCGCGG CCCGCCGCCC TCACGCCTGC CGCCCTCACG
CCTGCCGCCG CGGCCGTGAC GCCGACACCC ACCTGCGACG ATCCGCAGGC CAGCTGGCGC
CCGCCGGGCC GGCTGCCCTC GCCGGGCGCG ATGCCCACGG GCACCGTCCT GGAGACGATC
GAACGCCGCG GCTACCTCAT CGCCGGGGTG CTGGCCGACG TCCCGCCCTT CGGGTCGATC
AGCCCGTTCA CCGGCCAGTT CGAGGGCTTC GACGTGGAGA TCGCCAACCT GGTCGGCCGG
AGGATCTTCG GCGCGGACGG ACACGTGCGG TTCCGCGCGG TCACCTACGC CGAGCGCATC
CCGGTCCTGC GCGACGGCGC CGTCGACGTC GTCGTGGCGA CCATGACGAC GAACTGCGAG
CGGCGCGCGC TGGTGGACTT CTCCGCCGTC TACTACAACG AGACGCAACG GGTCCTGGTC
CCCCGCGACT CGCCGTACCA GGGGATGGAC GATCTCGGTG GGCGACGGGT CTGCACGGCG
GCCGGCGCGG CGGCCGGCGC GGCGGCGACC ATCCGGCGGG CACCGTCGCG CCCGGTGCTG
CGCACCGTGC CGAACATCGC CGACTGCCTC GTGCTGCTCC AGGCCGGGGA GGTCGACGCC
GTCATGACCA CCACGGCCAT CCTCAACGGG ATGGCCGCCC AGGACCCGCG GCTGTACGTT
GTCGGACCGG CCCTGTCGGA CGAGCCGGAC GCGGTCGCCG TCAGCCTCGA CCATCCGGAA
CTGACCCGCT TCGTCAACGG CGTGCTGGCC CGCGCGATCG CCGACGGCAC CTGGAAACGG
CTGGCCCGGC GGTGGCTGTC GGCACCGTTC GCCCCGCCGC CGACGCCACC GGTCGCCCGC
TACCGGGACT GA
 
Protein sequence
MMIIRVATAR AVTVLITVAL AVGVTACGTV DSPLPAQARP LAPATPTPPR PAALTPAALT 
PAAAAVTPTP TCDDPQASWR PPGRLPSPGA MPTGTVLETI ERRGYLIAGV LADVPPFGSI
SPFTGQFEGF DVEIANLVGR RIFGADGHVR FRAVTYAERI PVLRDGAVDV VVATMTTNCE
RRALVDFSAV YYNETQRVLV PRDSPYQGMD DLGGRRVCTA AGAAAGAAAT IRRAPSRPVL
RTVPNIADCL VLLQAGEVDA VMTTTAILNG MAAQDPRLYV VGPALSDEPD AVAVSLDHPE
LTRFVNGVLA RAIADGTWKR LARRWLSAPF APPPTPPVAR YRD