Gene Francci3_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1012 
Symbol 
ID3906699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1206129 
End bp1207571 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID637878345 
Producthypothetical protein 
Protein accessionYP_480124 
Protein GI86739724 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACCCA CCGGAGGTCA CCAGCGCCGG TGCCGGCCGG TGCGGATCTA CGTGGAATTC 
CGTAACCACT ACTCGTTCCT CGCCGACTCG CCCGAGGTCC GCCGGCTGCT GAACAGCTCG
TCGCTGATCT CGGAGCAGCT CGTCCAGCAG CTGCACGCGA ACGGCGTCGA TCCGCTGGAG
CTTCCGCAGT TCAGCGTGGT CCGACCCGGT GACCTGAAGC TGCACGCGGA CGAGCGGGTC
TACTCGGTCT CGCCGTTGGA ACTGCGGCTC GCTCAGCAGA CCGGCCAGCT CGCGCGGGGG
CCCGCCGTGG ATGCGGCCAG GCACCTCGCG GACCTGGTGC CGGCCCGCAT CGGGCAGGTC
CTGAACACCG CCGGGCTCGC TGATCTCCAG CCCAGGCTGG AGAGTTACTT CGGCCCGGCG
TTCCTGGCCG ATCAGGCGGT GTTCACTCTG GCGGACATCT GCACCCACGT CGGCATCGAA
TGCTTCGCCG AGGGACGGGT CACGCTGGAG ACGTTACTGC TCGCGCCCCT GTTCCGTCCG
GCCGGCGCCG GCACGGTGGC GTTGGTGCAC ACCGTTTTCC AGGAGTATTT CGCCGCGCGC
TACCTGCGGA CCGCGGCCGG ACGGGAGGCC GCGGGCAGCA ACGCCGAACC GATTCTGACC
GAGCAGGTCC GGGAGTTCCT CGTCCACCTC GGCGCCGAGA CGATCCCCGC GCCCCCGCGG
ACGCTGCCGG CAGCGACGTA CCTGGTCGGG CCGAGCCACC GCCTCCTTCT TCGAAAGATC
GAGAAGCCGG TGCTGTTCGA CGAGTTCCCC GTGACGGTCG GGCAGTACAA GGCGTTCCTC
GCCGCGGTCG CCGAGCAGGG CTGCGCGCAG TGGGATCATC CGGACACTCC GCCAGGCCAC
AGCCACGAAC CATGGCAGGA ACGCTTGCGC AACCCGGAGT ACTTCACCGA TCCGGTGTAC
GATAACTACC CGGCGAACTG CGTCAACTGG TGGAGTGCCT ACGCCTTCGC CCGCTTCGAG
GGCAAGCGGC TGCCGACCTG CGTCGAGTGG GAGGCCGCCG CCCGGGGCAC GGATGGCCGG
CTGTTCCCCT GGGGAGACGG CGTCGACCTG GCCGCGGTCA ACTGCGCCGA CGCGTGGAGC
GGCCGGCCGT TGGTGACTTA CGAGGTCTGG AAGCAGGAGA TCGACGGCGG CCGGTTGCGG
GACTGCGCCC CGACCCCGGT GACCGATCAT GCGGCGAACC TGTCCCCGTT CGGGGTGCAC
GGCATGTCCG GCAACGTGTG GGAATGGACC GAGACCGTGT TCGACGGGAT CAACTCCGCG
GTGATCTGCG GCGGCTCCTA CGACAACCCG TACCGGGCGG TGCAGACTTC GTCGAAGGCC
CTGTACCTAC GTCGGGGCAG CAGTAACGCC GTCGGGTTCC GCTGCGTGCG GGAGGTCGCG
TGA
 
Protein sequence
MVPTGGHQRR CRPVRIYVEF RNHYSFLADS PEVRRLLNSS SLISEQLVQQ LHANGVDPLE 
LPQFSVVRPG DLKLHADERV YSVSPLELRL AQQTGQLARG PAVDAARHLA DLVPARIGQV
LNTAGLADLQ PRLESYFGPA FLADQAVFTL ADICTHVGIE CFAEGRVTLE TLLLAPLFRP
AGAGTVALVH TVFQEYFAAR YLRTAAGREA AGSNAEPILT EQVREFLVHL GAETIPAPPR
TLPAATYLVG PSHRLLLRKI EKPVLFDEFP VTVGQYKAFL AAVAEQGCAQ WDHPDTPPGH
SHEPWQERLR NPEYFTDPVY DNYPANCVNW WSAYAFARFE GKRLPTCVEW EAAARGTDGR
LFPWGDGVDL AAVNCADAWS GRPLVTYEVW KQEIDGGRLR DCAPTPVTDH AANLSPFGVH
GMSGNVWEWT ETVFDGINSA VICGGSYDNP YRAVQTSSKA LYLRRGSSNA VGFRCVREVA