Gene Francci3_2585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2585 
Symbol 
ID3906491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3049097 
End bp3050149 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content76% 
IMG OID637879910 
ProductApbE-like lipoprotein 
Protein accessionYP_481676 
Protein GI86741276 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.501359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.267301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGCG CGGCAGCGGC GGTCGAGTGG TCGGTGTGGA GCACGACCGC CCGGCTCGTG 
GTGACCGAGC CCGGCGCACT GGCCGCCGCC CGGGAGATCG TCGTGGACCA TCTCGCGGCC
GTTGACGAGG TCGCCAGTCG TTTCCGGGCC GATGCGGAGA TCAACCAGCT CGATCACGCC
GATGGGACGC CGCAACGGAT CAGCCCGCTA CTGGCAGACC TCGTCGGTGC GGCGCTCCTC
GCGGCCCGGC GCACCGACGG GGACGTAGAC CCGACCGTCG GCGGACCGCT CGCCGCTCTC
GGCTACGACC GGGACATCAC GTTGCTGCCG TCGGACGGTC CGCGGCTGCG GGTGGTGCAC
CGGCCGGCGC CGGGCTGGCA GCGGATCCGG CTGACCGGGA ACACACTGAC GCTGCCGGAC
GACGCGCGGC TCGACCTCGG AGCGACGGCG AAGGCGCAGG CCGCCGACCG CTGCGCCGCT
CTGGTCGCCG AACGGCTCGA CACCGGCGTG CTGGTCAGCC TCGGCGGGGA TGTCGCCACC
GCCGGGAACG CGCCGGAGGG CGGCTGGCGC ATTCGGGTCC AGGACCGCCC GGGCGAGCCC
GCCTGCACCA TCACCCTGGC CGCCGGGAGC GCGGTCGCGA CGTCCAGCAC CCTCGGGCGG
CGGTGGCGGC GCGGCGGGCG GCTCCTGCAC CACATCCTGG ATCCGCGCAC CTGCCAGCCC
GCGCCGGTGG TCTGGCGGAC CGCCACCGTG GCGGCCGCGA GCTGCCTCGA CGCGAACACG
GCGAGCACCG CCGCGATCGT CCGTGGTTCC GCGGCGGTGG GCTGGCTGCG CCGGCTCGGG
ATGCCGGCAC GGCTCGTCGC CACGGACGGC TCGATCGTCA CGACCGCTGG CTGGCCCGCC
CCGAGCCGGG CCAAGGTGGC CGGAGCCGGA GTGGCCGGAG CCGGAGTGGC CGGGGTCCGG
GTAGACCGGG CCGAGACGGC TCCGCCCGCC GAGGATGGAA CCCGAGTCAC GTCCGCTGGA
TCGGCGACAC GGCCGAGAGG TTCCGGCCGA TGA
 
Protein sequence
MPGAAAAVEW SVWSTTARLV VTEPGALAAA REIVVDHLAA VDEVASRFRA DAEINQLDHA 
DGTPQRISPL LADLVGAALL AARRTDGDVD PTVGGPLAAL GYDRDITLLP SDGPRLRVVH
RPAPGWQRIR LTGNTLTLPD DARLDLGATA KAQAADRCAA LVAERLDTGV LVSLGGDVAT
AGNAPEGGWR IRVQDRPGEP ACTITLAAGS AVATSSTLGR RWRRGGRLLH HILDPRTCQP
APVVWRTATV AAASCLDANT ASTAAIVRGS AAVGWLRRLG MPARLVATDG SIVTTAGWPA
PSRAKVAGAG VAGAGVAGVR VDRAETAPPA EDGTRVTSAG SATRPRGSGR