Gene Francci3_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0394 
Symbol 
ID3903636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp467350 
End bp468570 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content58% 
IMG OID637877723 
Productcupin 4 
Protein accessionYP_479510 
Protein GI86739110 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATGG ATCATCAGCT CATCCGCGAT ATAAAGACCG CGTTGGCATG GTCCAGACCC 
AGCTCGACAC CCTCCCGGTT CCTACGGGGG ACTCTTCCTG ATTCTGGGAT ATGCCCCAGG
GTGCTCACGC CGACCAGATT TCTAGACCTG ATAATGAGAC GAAGCCTTGT CTCGCCCCAG
ATGCGGTGCT TTCAAAACGA ATCAGAGCTG CATCCAAACT CCTTGCTTCA GATGAACACG
ACGAGGCGAG GGCAGGTCAC TCCGATGGTC GATATGCGCC GGCTGGCCGG ACTTTTGCAA
TCAGGCTGCA CCCTGGTGTT GGACGCGGTC AACCACTTTG ATCCGACGCT AGAGGTTGCG
TGCCGAGCGT TTCAATGGTG GTTGCGCGCG CCTGTGCAGG CGAACGTGTA CCTCACAACG
GGCGACGCGG CGGGTTTCTC CCTACACTGG GATGATCACG ACGTCATCGT CCTACAGTTA
GCTGGCGACA AAGAGTGGGA GGTCCGGGGT CCGTCTCGCC GTGCTCCCAT GTACCGCGAC
GCCGCACCCA ATACTGAGCC TCCCAAAGAC ATCGTGTGGT CTGGTACCGT GAATACCGGT
GACGTGTTGT ATATTCCCCG GGGCCACTGG CATCGGGCGA GCCGCACCAG CAGAGGTGAC
GGGTTTAGCC TTCATGCTAC CTTTGGGTTC ACGAGACGAA CGGGCGTCGA TTGGTTGGCT
TGGCTTGCGG ACCAGTCGCG CCGCGAAGAG GTGTTTCGGG AGGATCTGAA TCAGCGGGGA
GAAGACCCGA AAGAACATCA AAACGACGGC GAGAAAATTA TTGTTGCCGC ATCGCGTCTT
CTTACGTCAC ATCCGCCGGC CCATTACTTG GAATCCGTGG CGCATGCCAC CTCCGCAGGC
CGGTATGTCT CCACAGCCGG CATTTTTGGT CCGCCATCCG CGGTCGTGTG CGTTACTGAT
TTTCCACCTC AGATAGAGAC CCAGGGCGAT ACAGTGGCGG TCGCGACGGC GGAGAAACGG
ATCGTCTTCA CCAGGAAAGC ATTACCAGCC CTTGGGTTGC TTCTGTCGGG CAATCCTGTG
TGCCTTGACT ACGTATCGTC CGCAGCGGGG ATCGATGGCG CGCGCCTTGG GGAGATACTT
GTCCGGGAGG GCATATGCGC GGAACTGACT CCGGAATTAT TCTCGGGCTA TACCGGTCTG
ACCACAGACG GCAAGCTTTA G
 
Protein sequence
MLMDHQLIRD IKTALAWSRP SSTPSRFLRG TLPDSGICPR VLTPTRFLDL IMRRSLVSPQ 
MRCFQNESEL HPNSLLQMNT TRRGQVTPMV DMRRLAGLLQ SGCTLVLDAV NHFDPTLEVA
CRAFQWWLRA PVQANVYLTT GDAAGFSLHW DDHDVIVLQL AGDKEWEVRG PSRRAPMYRD
AAPNTEPPKD IVWSGTVNTG DVLYIPRGHW HRASRTSRGD GFSLHATFGF TRRTGVDWLA
WLADQSRREE VFREDLNQRG EDPKEHQNDG EKIIVAASRL LTSHPPAHYL ESVAHATSAG
RYVSTAGIFG PPSAVVCVTD FPPQIETQGD TVAVATAEKR IVFTRKALPA LGLLLSGNPV
CLDYVSSAAG IDGARLGEIL VREGICAELT PELFSGYTGL TTDGKL