Gene Francci3_4292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4292 
Symbol 
ID3907260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5122032 
End bp5123216 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content71% 
IMG OID637881619 
Productcolicin V production protein 
Protein accessionYP_483367 
Protein GI86742967 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAACG TCCTGGACAT CGTTCTTCTC CTGCTCGTCG TACTCTTCGC GATCTCCGGG 
TACCGGCAGG GCTTCCTGGT GGGCGCGCTG TCGTTCGTCG GCTTCCTCGG CGGTGGTGTG
CTCGGTGCCA AGGTTGCGAA GCCCTTCGCC GAACTGATCG GCCAGGAGAG CCACGGCGCC
ATGGTCGGTC TCCTGGCCGT GCTCGGTCTC GCCCTGGTCG GGCAGGTCGC GGGCACCGCG
ATCGGGGTCG CCAGCCGGGA CCGGGTGACC TGGCGCCCCG GCAGGGCCGT CGATGCCGCC
GCGGGTGCGG TGCTCTCCGG CTTATCGGTG CTTCTTGTCG CCTGGCTGCT GGCGACGGCC
GTCGATCGAT CGCCCTTCGA GTCCCTCGCC CACTCGGTCC GGGAATCGCG GATCCTCGCC
ACCGTGGACG CCGGGATGCC CAACGAGGTC CGCAGTGCAT TCGCCGACCT GCGCCAACTG
GCTGACGACA ACGGCTTCCC CGAGGTCTTC GCCGGGCTCG GCGGCGGGCG CATCGTCGCT
GCCGATCCGC CCGATCCCGC TTCCACCCGG ACTGCCGGGG TCCGCAACGC GGCGGCGAGC
ATCGTGAAGA TCCGTGGCGT CGCCAGCTCC TGCGATAAAC GGGTCGAGGG ATCCGGGTTC
GTCATCGCCC CGCAGCGCGT GATGACCAAC GCGCACGTCG TAGCCGGAGT GCACCGTCCG
GTGGTCGTGC TGGCCTCGCG CACCCTGCCT GCCGAGGTCG TCCTGTTCGA CCCCGACCGG
GACGTCGCCG TGCTCCGCGT GCCGGACCTG CAACGGCCGC CGCTGCGGTT CCGGACAAAC
CCACCGGCCG CGGCCGAAGA CGCCGCAGTG ATCGCCGGTT ATCCCCAGGA CGGGCCATAC
ACCACTGTGG CGGCGCGCGT CCGCAACCAT CAGACCGCCC AGGCCCCGGA CATCTACTCG
CACGGCCTCG TCCTGCGCGA CATCTACGCG GTCCGTGGAC GGGTGTTGCC GGGAAACTCC
GGTGGACCCC TGTTGTCCGA AAGCGGAGAC GTCCTGGGCG TCGTGTTCGC CGCGGCCGTC
AACGACAGCG ACACCGGTTA CGCCCTGAGC GCCGCCGAAG TGGCCAGGCC TGCCGCCGCC
GGGGTGATCG CCACGGCGGA GGTGAGCACT CAGGGGTGCG ACTGA
 
Protein sequence
MLNVLDIVLL LLVVLFAISG YRQGFLVGAL SFVGFLGGGV LGAKVAKPFA ELIGQESHGA 
MVGLLAVLGL ALVGQVAGTA IGVASRDRVT WRPGRAVDAA AGAVLSGLSV LLVAWLLATA
VDRSPFESLA HSVRESRILA TVDAGMPNEV RSAFADLRQL ADDNGFPEVF AGLGGGRIVA
ADPPDPASTR TAGVRNAAAS IVKIRGVASS CDKRVEGSGF VIAPQRVMTN AHVVAGVHRP
VVVLASRTLP AEVVLFDPDR DVAVLRVPDL QRPPLRFRTN PPAAAEDAAV IAGYPQDGPY
TTVAARVRNH QTAQAPDIYS HGLVLRDIYA VRGRVLPGNS GGPLLSESGD VLGVVFAAAV
NDSDTGYALS AAEVARPAAA GVIATAEVST QGCD