Gene Francci3_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2191 
Symbol 
ID3906791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2564221 
End bp2565186 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content66% 
IMG OID637879523 
Productprolyl aminopeptidase 
Protein accessionYP_481289 
Protein GI86740889 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.73475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCAGC CGTACCGGCC TATCGAGCCA TACGCGCACG GCTTCCTCGA TGTAGGCGAG 
GACAACCAGA TCTATTGGGA AACCAGCGGC AACCCAGACG GCAAGCCGGC GTTGTGGGTG
CACGGAGGTC CCGGCAGCGG CGGGCGCCGC GGCAGCCGCA GGATGTTTGA TCCGGACGTC
TACCGGATCA TCCTGTTCGA CCAGCGTGGC TGCGGCCAGA GCCGACCGCA CGCCAGCGAC
GCGACCGTCA GCCTGGAACA CAACACCACC CGGCACCTGA TCGCCGACAT GGAGAGGCTC
CGCGAACATC TCGGCATCGA CCGGTGGCTC TTTTACGGCA GCTCATGGGG CTCGACGCTG
ATACTCGCCT ACGCCGAGCG CTACCCTGAA CGGGTCTCCG AGATCATCAT CGTCGGAGTC
ACCATGACCC GGCCCGAAGA AATCGACTGG CTCTACCGCG GCGTCGGGCG CCTGCTGCCA
GCCGCCTGGG AAGCGTTCCG CGACGCCGTG CCGAAGGACG ACTGGGACGG CAGTCTCGTT
GCTGCCTACA ACCGGCTGTT GGGCAGCCCC GACGAGGCGA TAAGAATCAC CGCGGCGCGG
GCCTGGTGCG CCTGGGAAGA CGCCGTCATC GCGCACGAAG CGCTTGGGGC GCCAGGCCAG
TACAGCAGCA AACCCGACGA TGCGCTCGTG GCCTTCGTCC GCATCTGCAC GCACTACTTT
GCAAACAACG CCTGGCTGGA GGACGGGCAG TTGCTACGCG ACGCTCACCG GCTGGCCGGG
ATTCCCTCGG TGTTGATCCA CGGCCGACTT GACCTTGCCA GCCCGCTCAA GACCGCCTGG
GATCTCGCCA AGGCATGGCC CGACGCGGAA CTGAAGATCA TCGACAATGC AGGCCACACG
GGCAGCCCCG CGACCCAGAA CGCCATCATC GAGGCGACCG AACGGTTCAG CACGAACGGA
CACTGA
 
Protein sequence
MGQPYRPIEP YAHGFLDVGE DNQIYWETSG NPDGKPALWV HGGPGSGGRR GSRRMFDPDV 
YRIILFDQRG CGQSRPHASD ATVSLEHNTT RHLIADMERL REHLGIDRWL FYGSSWGSTL
ILAYAERYPE RVSEIIIVGV TMTRPEEIDW LYRGVGRLLP AAWEAFRDAV PKDDWDGSLV
AAYNRLLGSP DEAIRITAAR AWCAWEDAVI AHEALGAPGQ YSSKPDDALV AFVRICTHYF
ANNAWLEDGQ LLRDAHRLAG IPSVLIHGRL DLASPLKTAW DLAKAWPDAE LKIIDNAGHT
GSPATQNAII EATERFSTNG H