Gene Francci3_4198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4198 
Symbol 
ID3907163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5011160 
End bp5013094 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content71% 
IMG OID637881526 
Producthypothetical protein 
Protein accessionYP_483275 
Protein GI86742875 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGC CCTCCAAGGT CGCCTTCGCC CTTCCCATAC CGGCTCTTGT GCGGGTGCGC 
GGAGCGATCG ACGAGGCCTG GCACGAGGCG CTCGCCGACC TTCCGGACGC TGTGGGCAGC
ACCGTGTCCC TGGGCATCGG GTGGGACCTC GCCTGGGAGC GCGAGCAGTG GCGATCGGCT
GTCGAGGACA GGCGCAGCCA CCTCTCCGTG CGCCTGTACG CGGACGAGAC TCTGATCGGC
CCCCTGTGGG CGCCCGCGAC CGATGCCGGA TGCAGCGGAT GCGCCGAGGT CCGCTCCCGG
GTGGTTCGCG CCCACCCGAT GGTCGAGGCA CTGCAAGTGC CCACCGAGGT CCCGCTTCCT
CGGTCTCCCC TGCTGCCAGA GCTGCTGGCG GCCGCCGTAC AACACCTCAC CGCCCAGCCT
CTCGGGCCCG GTGAGCTCTA CGCGGTCGGC ACCGGGTCAA CCCGCCGCCA CCGGGTTCCA
CGCAGCTCCG GCTGCCCTCT GTGCGGCGCC CGCACCACTG GCACCGCAAC CGCTCCCCCG
CCCGGCAGGC TGGTGCTGCA CGACCACCCG GCGGACCCTC ACGACCCCAC CCGGGGCCTC
ACGGGCCGCC GGTTGCTCGC CCCCGGCGCC CTGCGCCGTC GTACCGTCGA CCCGCGGTTC
GGCCCAGTAC GGGAAGTCGT CCGCGAGTCG CGCGCCCCCT ACGCGATGAG CATGGCGGCA
CTGCCCAACG CACCCGCCAT GGGATACGCG CGGGCCGTCG ACTTCGAAAC GGCCGAACCG
GTCGCGGTCC TGGAGGCTTA CGAGCGGCTG GGAGGCTTTC CGTACGAGGC TTCGGTGATC
GAGGACGTGG CCTACCAGGA GGTCGCAGAA CACGCCGTGG ATCCCGCCTC ACTCGGCGGG
TACACCGCGC AGCAACTGGC CCATCCGAGC ACTCGCGTGA CCCCGAGCTT CCCGTCCACG
CCCATGGACT GGGTCTGGGG CCACGACCTC GCCACCGGCA GACCCCTGCT TGTCCCCGCG
GACATCGGCT TCTACCAGTA CGACCACCGC TTCAAACGCT CGCACCATGC CGCGCAGCGA
GCCGCCCCGC ACGACCGTCG CCGCTATTTC CACGACTCGT CGAGTGGCTG CGCGCTGGGC
GGAAGCCTGG AAGAGGCGGC ACTGCACTCA CTGTTCGAAC TGGCCGAACG CGATGCCTTC
CTCATTGCCT GGCACTGTGC CGTCCCGCTG CCCGCCATCG ACCCGGCCTC CATCACCGAC
CCGGCCAGCC GTCGGTTGCT CGACCTGATC GACTCGCGGG GGTTCGACGC CCATCTGCTG
GTCGCCACCC AGGACATCGA CCTACCCGTG GTGTGGGCGC TCGCCATGAA CCGTGAGCGG
CACTTCCCGG CCACCTTCTC GGCCGCCGGG TCTGGCTGCA ATCCGGCGTC CGTGGTGCGC
AGTGCCCTGT GGGAGCTCGG CCAGATCGTC ACCGACCCGG TCACCTGGAC CAGAGCCGAC
ATCGAGCCCA TGCTCGCAGA CCCCTGGCTG GTCGAGGAAC TCGACGACCA CCTGCGGCTC
TACACCCTTC CTCAGACGCT CGGACGGGTC ACCCCGGTGC TTGGTGGTCT GCGGGTCCCT
CTCGACGAAG CGTTTCCCGG ATGGCCCGAC CGGCTGCGCG AGGAGGCGAA GGGCAGCGTG
CTCAGGGCGC TGCGAGCCAT GCAAGAACGT TTCGCCCGCG CCGGTCTGGA CCGGATCGTG
CTGGTCGACC AGTCCACCCG GGAACACCGG GACCTTCAGG TCGCCGTCGC CAAGGCGGTG
GTGCCGGGAA TCATCCCCAT GTGCTTCGGC CACGCGCAGC AGCGGCTGCT GGGCCTGCCC
CGGCTGACCG CAGCGCTCGC GGGCACGCCA ACGGCCGACC GGCCCTGCCC TTATGACCCT
CATCCGTTCC CGTGA
 
Protein sequence
MPEPSKVAFA LPIPALVRVR GAIDEAWHEA LADLPDAVGS TVSLGIGWDL AWEREQWRSA 
VEDRRSHLSV RLYADETLIG PLWAPATDAG CSGCAEVRSR VVRAHPMVEA LQVPTEVPLP
RSPLLPELLA AAVQHLTAQP LGPGELYAVG TGSTRRHRVP RSSGCPLCGA RTTGTATAPP
PGRLVLHDHP ADPHDPTRGL TGRRLLAPGA LRRRTVDPRF GPVREVVRES RAPYAMSMAA
LPNAPAMGYA RAVDFETAEP VAVLEAYERL GGFPYEASVI EDVAYQEVAE HAVDPASLGG
YTAQQLAHPS TRVTPSFPST PMDWVWGHDL ATGRPLLVPA DIGFYQYDHR FKRSHHAAQR
AAPHDRRRYF HDSSSGCALG GSLEEAALHS LFELAERDAF LIAWHCAVPL PAIDPASITD
PASRRLLDLI DSRGFDAHLL VATQDIDLPV VWALAMNRER HFPATFSAAG SGCNPASVVR
SALWELGQIV TDPVTWTRAD IEPMLADPWL VEELDDHLRL YTLPQTLGRV TPVLGGLRVP
LDEAFPGWPD RLREEAKGSV LRALRAMQER FARAGLDRIV LVDQSTREHR DLQVAVAKAV
VPGIIPMCFG HAQQRLLGLP RLTAALAGTP TADRPCPYDP HPFP