Gene Francci3_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0404 
Symbol 
ID3903646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp475165 
End bp476595 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID637877733 
Producthypothetical protein 
Protein accessionYP_479520 
Protein GI86739120 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGT GCAAGACCGG ATGGCATCTC GACGACGAAC CCCTTCCCGA CCCGGATCCG 
CCGTCCGACG ACGCCAAGGC ACTTGCCGTG CTGCGGGCGG ACCTGGAGGA AGCCCGCGGT
GAGGTCCTGG CGGCCGGCGA CCTCGGGCAG GTCGGCGAGA TCGACGAACT GATCGGCCAG
GTGGACCGGG AACTGTCAGA TCTGGGCGTG CGGGGCCGGA TCGCGCCGGC CGACCGGGAC
CGGCCGCGCC GCGTCCGCTC GACCCGTCGT CGCCAGGATG CCCCCGATCT GCCCCGGCTG
CCCGTTGAGC GGCGCACGGT CGGGCGGACG TTCACCGCGC CGGACGGGAC GGTGTGGCGG
CCGTCGATGT TCCTGACCCT GACCTGTGCC TCGTACGGGC GGGTGCACTC CGACGGCACG
CCGGTGGACC CGGCCTCCTA CGACTACCGG CAGGCGGCAC GGGACGCGAT CCACTTCCCG
AAGCTGCTGG ATCGGTTCTG GCAGAACCTG CGCCGCGCGG TCGGCTGGGA TGTCCAGTAC
TTCGCCGCGC TGGAACCCCA ACGGCGTCTC GCCCCGCACC TGCACGCCGC GATCCGCGGC
ACCATCCCGC GGACCATGCT GCGGCTGGTG GCGGCAGCCA CCTACCACCA GGTCTGGCGG
CCAGCGACGG ACCGGCCGGT CTACGACGAT CAGCATCTGC CCGTCTGGGA CGACACCATC
GGCGCCTACC TCGACCCCGA CGCCGGCGAC CCGCTGCCCT CGTGGGATGC GGCGCTCGAC
GCGATCGGTG AGGACGACGA ACCCGCGCAC GTCGTGCGGT TCGGTCCGCA ACTCCAGGCG
GACGGGGTGA CGGCGAACTC GGTGAACACC GGCCGGATGA TCGGCTATCT CACCAAGTAC
CTGACGAAGA CCCTCGACAC CTGTCACGAG ATCAGCAGTG ACCGGCAGCG GGCGCATGTG
GAGCGGCTCG CCGATGCCCT GCGCTACGAA CCCTGCTCCC CGACCTGTGC GAACTGGCTG
CGCTACGGCA TCCAACCCCG CAACCCCCGG CGCGGCCTCA CCCCCGGACG GTGCACCGGC
AAAGCCCACC GGCGCGAAAC CCTCGGCTTC GGCGGCCGGC GGGTGCTGGT CTCCCGCCGC
TGGTCTGGCA AGACCCTCAC CGACCACCGT CGTGATCGGG TCGTCTTTAT CCGCCAGCAG
CTCGCGGCAC TGGGCGCCAC TGGCACCGGA CCGGCCGCGC CCGAGGACGA TCCGACCCGC
ATCGCCTGGA CGCTGCTACG CCCCGGGGAC CCGGCCGCAC CCCGCCGCGA ACATCTGATC
TTGCATGCCA TCGCCCAACG GCACGCCTGG CGCGCACAGC TCGGAATCGG CACCGAGGAA
GGGGCCGGCG GGGATTCGGC AACAGGGCCG CCGCTGGCGG ACGCAGCCTG A
 
Protein sequence
MAQCKTGWHL DDEPLPDPDP PSDDAKALAV LRADLEEARG EVLAAGDLGQ VGEIDELIGQ 
VDRELSDLGV RGRIAPADRD RPRRVRSTRR RQDAPDLPRL PVERRTVGRT FTAPDGTVWR
PSMFLTLTCA SYGRVHSDGT PVDPASYDYR QAARDAIHFP KLLDRFWQNL RRAVGWDVQY
FAALEPQRRL APHLHAAIRG TIPRTMLRLV AAATYHQVWR PATDRPVYDD QHLPVWDDTI
GAYLDPDAGD PLPSWDAALD AIGEDDEPAH VVRFGPQLQA DGVTANSVNT GRMIGYLTKY
LTKTLDTCHE ISSDRQRAHV ERLADALRYE PCSPTCANWL RYGIQPRNPR RGLTPGRCTG
KAHRRETLGF GGRRVLVSRR WSGKTLTDHR RDRVVFIRQQ LAALGATGTG PAAPEDDPTR
IAWTLLRPGD PAAPRREHLI LHAIAQRHAW RAQLGIGTEE GAGGDSATGP PLADAA