Gene Francci3_1498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1498 
Symbol 
ID3904964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1792574 
End bp1793860 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content75% 
IMG OID637878835 
Productbiotin carboxylase-like 
Protein accessionYP_480603 
Protein GI86740203 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGTCC TGGTGATCGG ATTCGGGCAG GCGTTGCTCA ACCAGCTGGG TTCGGCGCTG 
CCGCCGGGCA GTGTCACCCT CGTCGAGGAC CCGGATCTGA TCGACCGGCG GCGAGTGCGG
CCGGCTGTCG AGAAGCTGCC CTGCGTGGCG GCCCTGCTCC CGGCGGCCTA CCACCAGAGC
GAGGACTTCC TCCCCGTCGT GCTCGCAGCG GCGGCCACCG GGGGACCCTT CACCGCCGTC
GTCCCGGCCC TGGAGTACAG CGTCCCCGCC GCGGCCTTGG CCGCGGCCGC ACTGGGCCTG
CCGGGCGCGG GAGTCGGGGC CGCCCGATGC CTGTCGGACA AGATCGAACT GCGCCGGGCG
GCGACGGCGG CGGGCATCGC GACCCCGCGC TGGCGGGAGG TCTTCGGCCC GGATGACCTC
GCGGCCTTCG GGGTGGAGAG CCCGGTGGTG CTCAAACCCG CCAACCGGCA CGGCAGTCTC
GGCGTGCAGA TCCTCGACGC GGTGGAGGAC GGGCCGCTCG ACCTGGCGGC CATCTGGTCG
GCGACGGTCG CCGCCCTCGA CGACGGCGCG CTTCCCAACC GCCCGTTGCA CTGGCGCTAC
CTGGTGGAGA GCCGGATGGT CGGCGCGGAG TTCAGCGTGG AGGCGCTGGT GCGCGACGGC
GTGCCGAGCT TCGTCAACGT GACGGCGAAG CGGGTGCTGC CCGGGCGGCA CCCGGTCGAG
ACCGGGCACG TCGTGCCGGC CCCCGGACCC GCGGCCAGGA CCGCCGTGCT GAGCCGCGCG
TTGACGCGTC TCATCGCGGC GATCGGCTTC GGGGACGGCA TCCTGCACGC CGAATGGATG
CTGACCGCCC GCTGCCCGGA CGACCCGGTG CTCATCGAGT GCGCCGGGCG GATCCCCGGT
GACAGTCTGG TCGAACTCAT CGACTTGGCC TACGGAACCT CGCTGGCCGC GGACTACCTG
GAGATTCTTT CCGGGGGGCG GCCCGCGCCG GCCGCGGTGG CGAGCGCCGC TGCGGCGATC
CGCTTCCTGA GTGCCTCACC CGGTCGGGTC GACGCGGTCG ACGGCGTCGA GGTCGCGCGG
GCGGTGCCCG GGGTGCGGCG GGTGGCGCTG GCGGTCACCC CGGGCACGTC CATCCCGCCG
CTGCGCTCGT CGGGCGACCG CATCGGCGAG GTGTTGGCGG TCGGCCCGAC CCCGATGGAT
GCCGAGGCCA CGGCGGCACG CGCCGCCGGG CTGGTGCGCG TGGCGGCGGC AGGGGCTCCG
TTCGGCGCGG CTGTCATGCG GATCTAA
 
Protein sequence
MRVLVIGFGQ ALLNQLGSAL PPGSVTLVED PDLIDRRRVR PAVEKLPCVA ALLPAAYHQS 
EDFLPVVLAA AATGGPFTAV VPALEYSVPA AALAAAALGL PGAGVGAARC LSDKIELRRA
ATAAGIATPR WREVFGPDDL AAFGVESPVV LKPANRHGSL GVQILDAVED GPLDLAAIWS
ATVAALDDGA LPNRPLHWRY LVESRMVGAE FSVEALVRDG VPSFVNVTAK RVLPGRHPVE
TGHVVPAPGP AARTAVLSRA LTRLIAAIGF GDGILHAEWM LTARCPDDPV LIECAGRIPG
DSLVELIDLA YGTSLAADYL EILSGGRPAP AAVASAAAAI RFLSASPGRV DAVDGVEVAR
AVPGVRRVAL AVTPGTSIPP LRSSGDRIGE VLAVGPTPMD AEATAARAAG LVRVAAAGAP
FGAAVMRI