Gene Francci3_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3574 
Symbol 
ID3904513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4272113 
End bp4273297 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content66% 
IMG OID637880895 
Productpeptidase M50 
Protein accessionYP_482655 
Protein GI86742255 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTTC TCGGCATTGC GGCGTTCGCA CTGGCCCTGC TGGTGTCGGT CGTCGCGCAC 
GAGGCCGGGC ATTTCGTGAC GGCCCGGCAC TACGGCATGA AGGCCTCCAA GTTCTTCGTC
GGTTTCGGGC CGACCATCTG GTCCCGGCGG CGGGGGGAGA CCGAGTACGG TGTCAAGGCC
ATTCCGGCCG GCGGTTTCGT CAAGATCGAG GGGATGACTC CGCTCGAGGA GATCGATCCG
GCCGATGAAC CCCGCGCCTT CCACAATGCG CGGGCGCGGG CCCGGCTCGT GGTCATGTCG
GCCGGTTCCT TCGTGCATTT CGTCATCGCC ATCGTGCTGG TCTACGGAGT GCTCGTTGTC
CTGGGCACGA CCACGATCAG CGAGTCGAGG GTCGGCGCGA CGAGTTGCAT CGCCACGACC
GCGACTTGTT CCGGACCGGG GCCGGCCGCG GCGGCCGGTC TGCGGCCGGG TGACCGGATC
GTCAGCTTCG GCGGAGTTCC GGTCACGACC TGGACGCAGT TCACCCGGCA GGTGCGTGCG
CACGGAGCGG GGCCTGCGGT GATGGTCGTC GAACGGGACG GCCGCACCCT CACTCTCACG
CCGAACCTGG TGGAGGTCCG GCGCGATCGG GAGACCGGGC AGGCGGGCGA CGACCGGGTC
GGCGCCTTGG GCGTCAAACC GGGAACCGAG ACAGTGCACT ACAACCCGAT CGAAGCGGTG
CCCCGCACCT TCGATGTCAT CGGGTCCGGG TTCACCGGCA TGTACGAAAC GCTGACCCGC
CGGATCGGTG ATATCGGTAA TATCTTCAGC GACAACCGCG ACCCCCAGGG TTTCATCAGC
GTGGTGGGAG CGGCGCGTAT CGGCGGTGAC GTGGTCTCGG CCGAGGGCAG TTCGGCCGTG
GACCGGGTGC GGAACCTTCT CATTCTGGTC GCCGCGATCA ATCTCGCGGT CGGAATTTTT
AACCTGTTGC CCCTACTCCC GTTGGACGGC GGTCATATTG CCGTGCTGGG CTTCGAGCAG
GCCCGGCACG GTCTACGCAG GCTCCGGGGT TATCGCGGTC CGGTGCAGAA GGTGGATTTC
GCCAAACTGT TACCAGCCAC GTACGCCACG GTCGTCGTAT TGCTCGGGTT CAGTCTGCTT
GTCCTGTCCG CCGACATCGT CAATCCCATT CGCCTGAATC AGTAA
 
Protein sequence
MELLGIAAFA LALLVSVVAH EAGHFVTARH YGMKASKFFV GFGPTIWSRR RGETEYGVKA 
IPAGGFVKIE GMTPLEEIDP ADEPRAFHNA RARARLVVMS AGSFVHFVIA IVLVYGVLVV
LGTTTISESR VGATSCIATT ATCSGPGPAA AAGLRPGDRI VSFGGVPVTT WTQFTRQVRA
HGAGPAVMVV ERDGRTLTLT PNLVEVRRDR ETGQAGDDRV GALGVKPGTE TVHYNPIEAV
PRTFDVIGSG FTGMYETLTR RIGDIGNIFS DNRDPQGFIS VVGAARIGGD VVSAEGSSAV
DRVRNLLILV AAINLAVGIF NLLPLLPLDG GHIAVLGFEQ ARHGLRRLRG YRGPVQKVDF
AKLLPATYAT VVVLLGFSLL VLSADIVNPI RLNQ