Gene Francci3_4233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4233 
Symbol 
ID3907199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5050565 
End bp5051902 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID637881559 
Producthypothetical protein 
Protein accessionYP_483308 
Protein GI86742908 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.855014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCG CCGCGATCCT CGCCCCCCGA ACCGGGATCA TCCGCCGGAT CGAACGCAAC 
GCCATTCCCG CCACCCTGCC GCCCGAATTC ACCATGTACA CCGCCGTTCT CTCCGACACC
ACCAGGTTCT CCGCCTGGGC CAGTGACTTC GCCGGAGCCG GCTACGCCCT GCTCGACGAC
GACGCCGCCC TCGGGCCCGC CGTCGGCGAG GCCGTCGAAC GCTACTGCGG GAACCTCGTC
CCCGCCGGAC TGCGCCGCGC CACCCACAAA GAGCTCACGG CGGACCGCGC CACCGTGCTT
GACCCGCGCT CGGTGGTGCT GTACTCACCC GCCCAGTACG CCGGGCCCTA CTTCCCGTTC
ACCGAGTACC GCGAGGATCT CGAGCTGGAA TGGACGTCCG GCACCGACCT GCTTGCTGGC
ACCCCCGTCT GGGTACCCGC GCAGCTCGTC TGGGTGTCCT ACGCCCACCA GGCCCAGGCC
CGCGGGTTCC CCTACCTCAG CCCGGTCCTC AGCGCCGGCC TGGCCGCCGG TATGGACCAG
CGCTCGGCGC AATGGTCGGC GATCTGCCAG ATGATCGAAC GTGACACGCT GACCATGGCC
TGGCACGGCC GCCGGCCGCT GCGGGCCATC ACCCCGCCCC CCTGGATCGC GCAGCTTGCC
ATGGGCGCGT ACGGAAGCAT GACGACCCGG TTCGTCGAAT TCCCGAACGA GTTCGGCCTC
GTCGTCGTCG GCGCCCTGGT GCACGACACG GCGACCGGCT ACCTGACCAT GGGAGCAGCG
TGCCGGACCA CCACCACGCC GGCGCTGCGC AAGGCCCTTG CCGAAGCCTT CCAGCTGCAG
ATGTTCGTCG CCGACCTCGA CGACTCCGAC GGCCCCTACA TGCGCGCCGC ACGCAACCCG
CACAGCCCCC TCAAGCCGTG GCGAGCTGAC CGACGGTACC TCGACGACTG CCGAGACGAC
CTGGCGGACG TCGTCGAGTA CTGCACCCAC CTCCAGCTCT TCCTCGACTC TCGCCTGCAG
GACCGGCTGG AGGCCGAACT CGCCGAGGCC CTCACCGGCA CGATCGGCTG GGAGACCCTC
GACCGGGACG CCCGGCACGC CGGAGTCGAC GACCCGACGG TGCTCGCGCG CACGCTCGCC
GACGCCGGCC ACCCGGTGAC ATCGGTCGAC GTCACCACCG AAGACGTGCG CCCCACCGGC
ATGCGGGTCG TGCACACCTT GGCCCCCGGC CTGTACTCGA ACACCTCTGT CGGCCTTCCG
TTCCTCGGCG GCGCCCGGCT GGCCAGGCAA CTCGCCGCGG CGGGCACCAC CCGGCGCGAC
CTCCCGCTGC CGCATTAG
 
Protein sequence
MDLAAILAPR TGIIRRIERN AIPATLPPEF TMYTAVLSDT TRFSAWASDF AGAGYALLDD 
DAALGPAVGE AVERYCGNLV PAGLRRATHK ELTADRATVL DPRSVVLYSP AQYAGPYFPF
TEYREDLELE WTSGTDLLAG TPVWVPAQLV WVSYAHQAQA RGFPYLSPVL SAGLAAGMDQ
RSAQWSAICQ MIERDTLTMA WHGRRPLRAI TPPPWIAQLA MGAYGSMTTR FVEFPNEFGL
VVVGALVHDT ATGYLTMGAA CRTTTTPALR KALAEAFQLQ MFVADLDDSD GPYMRAARNP
HSPLKPWRAD RRYLDDCRDD LADVVEYCTH LQLFLDSRLQ DRLEAELAEA LTGTIGWETL
DRDARHAGVD DPTVLARTLA DAGHPVTSVD VTTEDVRPTG MRVVHTLAPG LYSNTSVGLP
FLGGARLARQ LAAAGTTRRD LPLPH