Gene Francci3_2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2662 
Symbol 
ID3904886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3142594 
End bp3143664 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content75% 
IMG OID637879987 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_481753 
Protein GI86741353 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0480971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGG AGCAGGGCCC GGTCGTGCTG GGCATCGAGA CGTCCTGCGA CGAGACCGGG 
GTGGGGCTCG TCCGGAACGG CACGCTGCTG GGCGAGGCCC TGTCGACGAG CATGGACCAG
CACGCCCGCT ACGGCGGGGT GGTCCCCGAG ATCGCCGCCC GGGCCCACGT GCAGGCGCTG
GTGCCCTGCG TGCGCGCGGC GCTGTCCTCG GCGGGGCTGT TCGTGGCGGA CATCGGCGCC
GTCGCGGTCA CCGCCGGCCC CGGCCTCGCC ACCGCCCTGC ACGTCGGGGT GGCCGCGGCG
AAGGCGTACG CCACGGCGCT CGATGTTCCC CTCTACGGCG TGCACCATCT CGCCGGTCAC
CTCGCCGCGG ACCTCGTCGA CGGCGAACCG CTACCCGATC CCCTCATCGC CCTGATCGTC
TCCGGCGGGC ACACGTCGCT GCTGCGGGTG GGGGACCTCG CTCGCGACCC GATCACCCAC
CTCGGCGACA CGCTTGACGA CGCGGCCGGG GAGTGCTTCG ACAAGGTCGC CCGGGTGCTC
GGCCTGCCCT ATCCGGGCGG TCCCGCGGTC GACCGAGCCG CGGTCGGCCA CGATGCGACG
GCGCTGGCCT TCCCCCGGCC GCTGACCGGC CGGGCGGACG CGCCCTACAC CTTCTCGTTC
TCGGGGCTGA AGACCGCCGT CGCCCGATGG GTCGAGTCGC ATCCCGACTC CCCCGTACCG
GCCGGCGATG TGATCGCATC CTTCCAGGAG GCAGTCGTCG ACGTGCTCAC CGCCAAGGCG
GTCCGTGCCT GCCTCGACCA CGGGATCGGT GACCTGCTCA TCGTCGGCGG GGTCGCGGCG
AACAGCCGGC TGCGGGCGCT GGCGGCCAGC CGCTGCGAGC AGACCGGCAT CCGGCTGCGG
ATACCGGCCC GCCGGCGGTG CACGGACAAC GGCGTGATGA TCGCGGCGTT GGGTGACCTG
CTCGTCCGCG CCGGCGCCGA GCCCTCCCCC GCCGAGCTCA CCGCCATGCC GGGCGCGTTC
CTCGAACGGG CCCAGCTCGG CACCGCGCTG CCGGCCCTGC ACGCCGCGTG A
 
Protein sequence
MPPEQGPVVL GIETSCDETG VGLVRNGTLL GEALSTSMDQ HARYGGVVPE IAARAHVQAL 
VPCVRAALSS AGLFVADIGA VAVTAGPGLA TALHVGVAAA KAYATALDVP LYGVHHLAGH
LAADLVDGEP LPDPLIALIV SGGHTSLLRV GDLARDPITH LGDTLDDAAG ECFDKVARVL
GLPYPGGPAV DRAAVGHDAT ALAFPRPLTG RADAPYTFSF SGLKTAVARW VESHPDSPVP
AGDVIASFQE AVVDVLTAKA VRACLDHGIG DLLIVGGVAA NSRLRALAAS RCEQTGIRLR
IPARRRCTDN GVMIAALGDL LVRAGAEPSP AELTAMPGAF LERAQLGTAL PALHAA