Gene Franean1_7239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7239 
Symbol 
ID5675540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8839011 
End bp8840720 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content74% 
IMG OID641246076 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001511464 
Protein GI158318956 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.108633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.212332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCA CGCACGACGC CCGGTCCCCT TGGGCTCCGC CCGACAACGC GCCGGAGCGG 
ACCAGGGCCG AGGCGAACAG TTCGGAGGGG TCGGTGGACC CGCGGTCGAA TCCCGTCGAA
CACCAGCAGT ACCCCGGTAC GCCGCCGGCC GGCGGCCCGC CGCCCGGGCC CGGCCCGAGC
CGTCCTGACC AGACCGGCGG TTTTCCGGCG GGCCCCCCGC CGGCGCCGCA CCCGGCCGGG
CCGGCGCCCG GCACGGGCGC GCCCGGCGGG CCCTACGAGC CGGGCCCGCA GCAGGGCCCC
CGGCCGACCG CGCCCTACGC CCAGACCTCC GGCTGGGCCG ACCCCCGCTC CACCGCCGGC
GCTGGCGACA GCACGCAACG GGTCGACCAG GGCGGCCCGG CCACCGGCCA GGGCCAGCAG
CAGTCCGGCG CGTGGTGGAA CCCCGGGCAT CCCGGCTGGG GCGCCCCGGT GCCGCCCGGA
GCGGCTCCCG GCGGTCAGGC ACCCGGTGGC CAGGCGCCCG GGTCACCGGG CGGCTCCGAT
CCCTACGGGC GGTTCACCCC GGCGAAGCCG AACCCGGCGC CCATGCGCCG GCGTCGGATG
ATCGCGGCCG CGCTGGCGAT CGCGCTCGTG TCGGGCGGCA TCGGCGGCGG CGTCGGCGCA
CTCGTCGCCA GCGACGACTC CCCGGCGGTG GTGACCTCCT CCGCCGGCCT GCGGCAGTCG
ACCGGCACGG CCGGCGTCTC CCCGGCCGCG GACAACACGG TCGCCGCGGC CGCGCAGGCG
ATCCTGCCCA GTGTCGTGAC GATCGCCGAA CAGTCCAGCC AGGAGTCGGG CACCGGCTCC
GGGACGATCA TCCGCGCGGA CGGCTACATC CTCACGAACA ACCACGTGGT GTCCGGCGCC
TCGCAGGGCG GCACCCTCAC GGTCACGATG CAGGACGGCC GGACCTTCGA CGCGCAGGTC
AAGGCGACCG ACCCGAGCTC CGACCTCGCC GTGGTCAAGA TCGACGCTAC CGGCCTGCCC
GCTGCCACGT TCGGGGACTC CGACGCACTG CAGGTCGGCG AGCTCGTCGT CGCTGTCGGC
AGCCCGCTGG GCCTCAACGG GACGGTCACC TCCGGCATCG TCAGCTCGCT GCACCGGCCC
GTCCGCACCG GCGACGCGAC CGTGCGGGAC CAGCAGAACA CCGTCCTGGA CGCGATCCAG
ACCGACGCGC CCATCAATCC CGGCAACTCG GGTGGGCCGC TGGTGAACAG CAAGGGCGAG
ATCATCGGCG TGAACACCGC AATCGCGACG GTCGGCGGCA GTTCACCCTT TGGTGGTAGT
CAGCAGTCCG GAAACATCGG GGTTGGTTTC GCGATTCCGG GCAACTACGC GGAGAAGGTC
GCCGGCCAGC TCGTCGACAA CGGCGCAGCG CAGCACCCCT ATCTGGGTGT GAGCGCCTCC
ACCGCCGACG AGAACACCCG GTCGACGGCC GCGAGTGGGA CGGGCGCGCA GATCCGCTCC
CTGGTCAGTG GAGGCCCGGC AGACAAAGCG GGCCTGCACG TAGGTGACGT CATCACCAAA
GTGGGGGATC GCGCCGTCAC CGACGTGGAT TCGCTGATCG CCGCCGTCCG GTCCTACGAG
ATCGGCAACC AGGTGCAGGT CACCTACCAG CGCGACGGCT CCAGCCAGAC CGCGACGGTC
ACGCTGCTCG AACAACCGCC CAATTCCTGA
 
Protein sequence
MTTTHDARSP WAPPDNAPER TRAEANSSEG SVDPRSNPVE HQQYPGTPPA GGPPPGPGPS 
RPDQTGGFPA GPPPAPHPAG PAPGTGAPGG PYEPGPQQGP RPTAPYAQTS GWADPRSTAG
AGDSTQRVDQ GGPATGQGQQ QSGAWWNPGH PGWGAPVPPG AAPGGQAPGG QAPGSPGGSD
PYGRFTPAKP NPAPMRRRRM IAAALAIALV SGGIGGGVGA LVASDDSPAV VTSSAGLRQS
TGTAGVSPAA DNTVAAAAQA ILPSVVTIAE QSSQESGTGS GTIIRADGYI LTNNHVVSGA
SQGGTLTVTM QDGRTFDAQV KATDPSSDLA VVKIDATGLP AATFGDSDAL QVGELVVAVG
SPLGLNGTVT SGIVSSLHRP VRTGDATVRD QQNTVLDAIQ TDAPINPGNS GGPLVNSKGE
IIGVNTAIAT VGGSSPFGGS QQSGNIGVGF AIPGNYAEKV AGQLVDNGAA QHPYLGVSAS
TADENTRSTA ASGTGAQIRS LVSGGPADKA GLHVGDVITK VGDRAVTDVD SLIAAVRSYE
IGNQVQVTYQ RDGSSQTATV TLLEQPPNS