Gene Francci3_0314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0314 
Symbol 
ID3903346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp361643 
End bp362797 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content71% 
IMG OID637877643 
Productamidohydrolase 2 
Protein accessionYP_479430 
Protein GI86739030 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.384708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCCGCC CGGTGGAACC GTTCGTAGAT CACGAGCTCG TCGACCATCA CTGCCATGGT 
CTGGTCGGCC GCGATCTGAC TCGCCCGGAA TTCGAAAGCA TGATCACTGA GGCGGAACAC
CCCGGACCAC CGGGGACGAC GCTGTTCGAC AGCCAGATCG GCTTCGCCCT GCGGCGCTGG
TGCGCGCCGG TGCTCGACCT GCCGGCGCAT GCCGCGCCGG AAGACTACCT CGAACGCCGG
GCGGAGCTTG GCCACGCCGA GGTGCACCGG CGGCTGCTGC GTGCCTGCGG GATCACCACG
TTCTGCGTTG ACGCCGGATT CCAACCGGAG CCGCTGACCA GCGCCGCGGA GCTGGCCGAG
CTCGCCGGCG GACGCGGCGT GGACGTGGTG CGGTTGGAGC GGGTCGCCGA GCAGGCCGCG
GTCGACGTCA TCACCGGGCA GATCGGCATG ACCCACCTCG CCGACACCGT GCGGGCCCGG
CTCGAGTCCA CCCGCCCCAC CATGGTCGCG GTGAAGTCGG TCGCGGCCTA CCGCGGCGGC
CTGGAGCTGC CCGCACAGCG ACCGACCGAC CGCGAGGTCG CCGCCGCCGC TCGCGGCTGG
ATCCACGAGA TCCGCGCCGG CGCCCCGATC CGGCTGACCG ACCCCACCCT GCATGCCTTC
CTCATCTGGT GCGGGGTCGA CGCGCGGCTC CCCATCCAGA TCCACGTCGG ATACGGGGAC
AGCGACATCG ACCTGCGCCG GGGCAACCCA CTGCTACTCA CCGGCCTGCT GCGGGCGATA
GCACCGACCG AGGTCAGTGT GCTGCTGCTG CATTGCTACC CGTTCCACCG GGAGGCTGGC
TACCTGGCCC AGGTGTTCCC GAACATCTAC CTTGACCTCG GGCTCGCCAT CCACAACTGC
GGCCGCGGCT CGGCCGGCCT GCTCGCGCAG GCACTCGAAC TCGCCCCGTT CGGCAAGTTC
CTCTTCTCCA CCGACGCCTT CGCGCTCGGC GAGCTGTACC TGCTCGGGGC CGTCCTGTTC
CGGCGTGGGC TGTCGGCGTT CCTCGCGGCG GGCGTCGCCG ACGACGCCTG GACCGCGGCG
GACGCCAAGC GGGTCGCCCG GCTCATCTGC GCGGACAACG CCCGCCGCGT CTACAACCTG
TACAACCCGC CCTGA
 
Protein sequence
MGRPVEPFVD HELVDHHCHG LVGRDLTRPE FESMITEAEH PGPPGTTLFD SQIGFALRRW 
CAPVLDLPAH AAPEDYLERR AELGHAEVHR RLLRACGITT FCVDAGFQPE PLTSAAELAE
LAGGRGVDVV RLERVAEQAA VDVITGQIGM THLADTVRAR LESTRPTMVA VKSVAAYRGG
LELPAQRPTD REVAAAARGW IHEIRAGAPI RLTDPTLHAF LIWCGVDARL PIQIHVGYGD
SDIDLRRGNP LLLTGLLRAI APTEVSVLLL HCYPFHREAG YLAQVFPNIY LDLGLAIHNC
GRGSAGLLAQ ALELAPFGKF LFSTDAFALG ELYLLGAVLF RRGLSAFLAA GVADDAWTAA
DAKRVARLIC ADNARRVYNL YNPP