Gene Francci3_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2909 
Symbol 
ID3903973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3423987 
End bp3426194 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content74% 
IMG OID637880230 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_481996 
Protein GI86741596 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.390736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGAC CCGCGGTGGT GGCGCAGCCG GTGGTGGGAC GGCCGGTGGT GGGACGGCCG 
GTGGTGGCGC ACACCGTCGA CTCCGGGCGT CCCGGCGATC TGTCCGACGC CTGGGTCGCG
GCGGCGAGCT GTCGCGCACC GAGTACCGGC CCGCATGGTC GGCTCGCCTT CGTCGGCGAT
CGCGGCGGCG CCCCCGCGCT GTGGATCCGG GACCCGGATG GCCGGGAGCG CCCGGTTGAC
ACCGGCCCCG GTCATGTCCG GGCGGTCCTG TGGTCGCCGA CCGACGACCG GATCGCCGTC
CACGTCGCGC CGGGCGGCGG GGAGCTCACC GAGGTCCGGA CCGTCCGGGC GGACGGCGGT
GACCTGCGCC ACCTGGCCGG CGGCGGGAGG CGGGCCGCCA CCCCAGCCCG CTGGACCGAC
GACGGCCACG GCCTCGTCGT GACCGAGTCC GACCGCTCCG ATCCGTCCGG CCGCACCGCC
GCGGTGCTGA TCGACTCCGA CGGGCGACGG ACCCGCCTCG CCGCCGGAGT GGCGCTGCAG
GTGTGCGACG TCGTCGTTCT CCCCGACCCC GCGCCGGACG GGACGGGGAG CGGCGGTCCG
GGGCGGGGCC CGGCCGGTGG CGGGTACCGC CTGCTGCTGC GGGAGGGCCC CCGTGGCGTC
CGGCGGGTGC TCGTGGTCGA CACCATGATC GCAGCCAGGG AGGGGATCGC AGCCAGGGAG
GGGATCGCAG CCAGGGAGGG GATCGCAGCC AGGGAGGGGA TCGCAGCCAG GGAGGGGATC
GCAGCCAGGG AGGGGATCGG CGGGTCATGG CAGGTGCCCG GCGGGACGGC GACCACCGTG
ACCGGTCGGT TCGTCGCCGA CGGCCGTCGC CTGCTGCTGC TGTGCGACAT CGGGCGGGAC
CGGGCCGCGT TGTTCGACGT CCCCGTCGAC CCCACGGGCG ATCCGACCGG CGGGCCGCGG
GTCCTCGCCG CCCGGGATGA CGCCGATCTG GAGCGGTTCG CCCTGCTCGA TCCGGATCGG
GCGGTCGTCG TCTGGAACGT GGGCGGGCGC AGCGAGCTCG CGCTGGTCGA TCTGCCCACC
GGACGGTCAC TGACGCTGCC GCCGCTTCCC CGCGACGTCG TCACCGGCGT GCTCGTCCGG
CCCGGGGGGC GGGAGCTCGT GCTTGCCCTG GATGGTGCCA CCAGGCCGGG GGAGATCTGG
ACCTGGGATC TCACCGCGCC ATCCGCCGGC TACCGTCGGC TGATCTCGCA CGCTCCCGTC
GAGGCAGTGA CCATCCCGGG ACCGCGGGTC GTCCTGAACA TCGACGTTGA CGGCCTGGGA
TCCGACAGCC CGGGGCCATC CGGGCCATCC GGGCCATCCA TGCCATCCGG GCTATCCGCG
CCGGCGCGTG TACCCGGGAT CCCCGGGCCC CGCACCGGTG ACCACCCGCG GCTGGTCCGT
CCGGTGCCGT GCGCCCTGCC GGCCCACGAC GGCCGGGAAC TGTCCGGTTG GTGGTACCGC
CCGCACGGCC CGCGAGGACC GGTCCCGACC CTGCTCCACC TGCACGGCGG CCCGGAGGCG
CAGGAGCGTC CCGTCTACAA CCCGCTGTTC CAGGCCGTGC TCGCCCGGGG GATCGCGGTG
TTCGCGCCGA ACGTGCGCGG CTCCACCGGA TTCGGCCGGT CGTTCGAGGA GGCCGATCAC
ACCCATCGTC GGTTCGGCGG CATCGCCGAC GTCCGCAGCT GCGTGGCGCA TCTGGTCGCG
ACGGGCCTGG CTGACCCGGA CCGGATCGGC GTCGCCGGCC GCTCCTACGG CGGCTATCTG
ACGCTGGCGG CGATGGTCCA CTTCCCGGAG CTGTTCCGGG TCGGGGTAGA CGTCTGCGGG
ATGGTTGATC TGGAGAGCTT CTATCAGTAC ACCGAGCCGT GGATCGCCGC GTCCGCGGTC
ACCAAGTACG GTGATCCACG CACCGAGCCG GCGTTGCTGC GGGCGCTGTC CCCGCTGCAC
CGGATGAGCG CCCTCGCCGC GCCGTTGCTC GTCGTGCATG GGGAGAACGA CACCAACGTC
CCGGTGATCG AGGCGGAGCA GACGGTCGCC GCGGCCCTCG CTCGCGGCGT CGACTGCCGT
TACCTGCTCT TTCCCGGCGA GGGGCACGAG ATCGCGGATC TGCGCCACCG CAGGTCGTTC
GTCCGAGCGG TCGTCGACTG GCTGACCCCC CGGCTGCTGA CCCCCTGA
 
Protein sequence
MIRPAVVAQP VVGRPVVGRP VVAHTVDSGR PGDLSDAWVA AASCRAPSTG PHGRLAFVGD 
RGGAPALWIR DPDGRERPVD TGPGHVRAVL WSPTDDRIAV HVAPGGGELT EVRTVRADGG
DLRHLAGGGR RAATPARWTD DGHGLVVTES DRSDPSGRTA AVLIDSDGRR TRLAAGVALQ
VCDVVVLPDP APDGTGSGGP GRGPAGGGYR LLLREGPRGV RRVLVVDTMI AAREGIAARE
GIAAREGIAA REGIAAREGI AAREGIGGSW QVPGGTATTV TGRFVADGRR LLLLCDIGRD
RAALFDVPVD PTGDPTGGPR VLAARDDADL ERFALLDPDR AVVVWNVGGR SELALVDLPT
GRSLTLPPLP RDVVTGVLVR PGGRELVLAL DGATRPGEIW TWDLTAPSAG YRRLISHAPV
EAVTIPGPRV VLNIDVDGLG SDSPGPSGPS GPSMPSGLSA PARVPGIPGP RTGDHPRLVR
PVPCALPAHD GRELSGWWYR PHGPRGPVPT LLHLHGGPEA QERPVYNPLF QAVLARGIAV
FAPNVRGSTG FGRSFEEADH THRRFGGIAD VRSCVAHLVA TGLADPDRIG VAGRSYGGYL
TLAAMVHFPE LFRVGVDVCG MVDLESFYQY TEPWIAASAV TKYGDPRTEP ALLRALSPLH
RMSALAAPLL VVHGENDTNV PVIEAEQTVA AALARGVDCR YLLFPGEGHE IADLRHRRSF
VRAVVDWLTP RLLTP