Gene Francci3_2361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2361 
Symbol 
ID3904568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2735443 
End bp2737239 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content76% 
IMG OID637879691 
Producthypothetical protein 
Protein accessionYP_481457 
Protein GI86741057 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.210389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.401092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGCC GCGGGCCGGT CCTGCCCTGG GTGATCCTCG GCGGCCTGTG GGGCAGCATC 
GGCGTCGGCT GGCTGGCATG GATGACCGTC CGCCTGGCGG CCGTCGTCGG GGGCGGCCAT
CCACCGGCGT TCGGCGCGTT CGTCACCGCC GTGCTCGCCG GCGACGCCAC CCGGGCCACC
GGCGCAACCC CGGCCGGGTG GGTTGTGGTG TTCGCCGTGC TGGCCCTCGC CGCCGCCACC
GCCCTGGCCG TCGTCGCCGT GCGCGCCGTG CGCCGGGCAC GGCGGCGTCG GCGCCGCTCG
GCACCCTGGC GGCTTGCCCT GCCCTCCCTC GCCGACCCGG CCGACCTGGC CACGCTCACC
CCTGCCGGCG CCGCCGACCG TGCCCGCGCG CTGCGCCCCT CCCTGTCGGA TGCCGATGCC
CGCCGGCTCG GTGACGACGC CGGCCTGCTC CTCGGTGACC TGCTGCCCCG CGGGCTGCCG
CTACGCGCAT CCTGGGAGGA CGTGCTCCTG GCGGTCATGG CTCCCCGGGC CGGGAAAACC
ACGGCGCTGG CGATCCCGAT GACCCTCGCC GCGCCGGGCC CGGTACTCGC CACCGGCAAC
AAGGCCGACC TGTGGGCCGC CACCGCACAG GTGCGGGCCG GCGACGGCCG CCGGGTGTGG
ACGTTCGACC CGCAGGCCAT CGCCCACGCC CCCCAGACCT GGTGGTGGAA CCCGCTCGCC
GCCGTGCACG CCGTCGAGGA CGCCGACCGG CTCGCCGGGC ATTTCCTCCA GGAGATCCGC
GGGGAGAAAA CCGGCGGGGA CTTCTGGCAG GCCGCCGCCG GGGACCTGCT CGCCGCCCTG
TTCCTCGCCG CCGCCACCAG CGGGCGCACC CTGCTCGACG TCTACGAATG GCTCAACGAC
TCCGCCAGCC CCGTCCCCGC CGAACTCCTC GCCGCCGGCG GCTACCCGGC CGTCGCCGCC
GGGCTGCGCG GACGGCAGGC CGGGGCACCG GAAACCCGTG AAGGTGTCTA CGAAACCGCC
CGCGCCGCCG CCCGCTGCCT GCGTAACGAC CGGATCCTCG CCTGGGTCAC CCCCGGCCAC
ACCGACCGCC GCCTCGACGT GGCCGCGATC CCGGCCGGCC GCGACGTGCT GCACCTGCTG
TCCAAGACCG ACGAAGGTGC CGCCTCCCCG CTGGTCGCCG CGCTGACCGA CCAGATCGTC
CGCGCGGCCG TCGTCGCCGC CGAACGTTCC GGTGGCCGCC TCGACCCGCC GCTCGCCCTC
GTCCTCGACG AAGCCGCGAA CATCTGCAAG ATCGCCGATC TGCCCGACCT GTACTCCCAT
CTCGGCAGCC GCGGCATCGT CCCCCTGACG ATCCTGCAGT CCTACCGCCA GGGCGTGCGC
GTGTGGGGCG AGGCCGGCAT GGACGCCCTG TGGTCGGCGG CCACCATCAA AATCATCGGC
GCGGGGATCG ACGACCCGCG CCTCGCCGAG GACCTCTCCC GCCTCGTCGG CGACCACGAC
GTCGACACCA CCTCGGTCAC CCGCTCCGCC CAGGGCGCCT CGTCGACCAT CTCCAGCCGC
CGCCAGCGCA TCCTGGAGGC CGCCGACATC CGCGCCATCC CCAAAGGCCG CGCGCTGCTG
CTCGCCACCG GCTCCCGCAT CGCCGCGATC GCCCTGCGCC CCTGGTACAC CGGCCCCCGC
GCCACCGAGA TCACCGCCGC CATCCGTACC GCCGAAGCCA CCCTGACCGC CCGCGCCACC
GGCGCCCACC CCGCCGGCGA GGAGGAGACC GATGACAGCC CCCTCACCCA CCTCTGA
 
Protein sequence
MNGRGPVLPW VILGGLWGSI GVGWLAWMTV RLAAVVGGGH PPAFGAFVTA VLAGDATRAT 
GATPAGWVVV FAVLALAAAT ALAVVAVRAV RRARRRRRRS APWRLALPSL ADPADLATLT
PAGAADRARA LRPSLSDADA RRLGDDAGLL LGDLLPRGLP LRASWEDVLL AVMAPRAGKT
TALAIPMTLA APGPVLATGN KADLWAATAQ VRAGDGRRVW TFDPQAIAHA PQTWWWNPLA
AVHAVEDADR LAGHFLQEIR GEKTGGDFWQ AAAGDLLAAL FLAAATSGRT LLDVYEWLND
SASPVPAELL AAGGYPAVAA GLRGRQAGAP ETREGVYETA RAAARCLRND RILAWVTPGH
TDRRLDVAAI PAGRDVLHLL SKTDEGAASP LVAALTDQIV RAAVVAAERS GGRLDPPLAL
VLDEAANICK IADLPDLYSH LGSRGIVPLT ILQSYRQGVR VWGEAGMDAL WSAATIKIIG
AGIDDPRLAE DLSRLVGDHD VDTTSVTRSA QGASSTISSR RQRILEAADI RAIPKGRALL
LATGSRIAAI ALRPWYTGPR ATEITAAIRT AEATLTARAT GAHPAGEEET DDSPLTHL