Gene Franean1_2744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2744 
Symbol 
ID5671135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3246372 
End bp3248264 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content71% 
IMG OID641241656 
ProductType IV secretory pathway VirB4 protein-like protein 
Protein accessionYP_001507076 
Protein GI158314568 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00908166 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAC GATCCCGACG CCGCACCTCG GCGCAGGCCC CCGGCCCGTC GGCAGGGCCG 
CTGGACGCCG CAGCGGCGGC CTTTGTCCCG GACGCGCTGT CGATCGCGCC CCGCCACCTG
GACGTGGGCG GGGACTTCCT TGCCACGATG GCGATCACCG GCTATCCGCG TGAGGTCCAC
GCGGGCTGGC TCGCCCCGCT GGTGACCTAC CCGGGCCGGG TGGATGTCGC GGTGCATGTC
GAGCCGATCG ACCCGGTCAC CGCGGCGAAC CGGCTCCGCC GGCAGCTGGC GAAGCTGGAG
TCCGGCCGCC AGCTCGGCGA TGAGAAGGGC CGGCTGGTCG ACCCACAGGT CGAGGCGGCG
ACCGAGGACG CCTACGACCT GTCCGCCCGC GTCGCCCGCG GCGAAGGGAA ACTCTTCAGG
CTTGGTCTCT ACCTGACCGT CCACGCGTCC AGCGAGAACG AGTTGGCTGA CGAGGTCGCG
GCGGTGCGGG CGCTCGCAGC CAGCCTGTTG CTGGATGCCA AGCCGGTCAG CTACCGGTCG
CTGCAAGGCT GGGTGAGCAC CCTGCCGCTG GGGTTGGACC AGGTGCGGAT GCGCCGCACC
TTTGACACCA CCGCCCTCAG CGCGGCGTTC CCGTTCACCT CACCTGATCT GCCGCCGCCC
GACCCGACCT CGCTGGCCCC GACCGGGGTG CTCTATGGGC TCAACGTCGC CAGCAACGGG
CTGGTGCACT GGGACCGGTT CGGGGATGTC GACAACCACA ACGCGGTCAT CCTCGGCCGC
TCAGGTGCCG GCAAGTCCTA TCTGGTCAAG CTCGAACTCC TGCGCTCGCT GTACCGGGGC
ATCGAGGTCC ACGTCGTCGA CCCGGAAGAC GAATACGCCC GGCTCGCCGC CGCGGTCGGC
GGGACCTACC TGCATCTCGG TGGCGACGGG GTACGGATCA ACCCGTTCGA CCTGCCTATT
CAGACCACTC CCGACGGCCG GCGGACCGCG CCGCGCGACG CGCTGGTGCG GCGCAGCCTG
TTCCTGCACA CCGTGATCTC CGTGCTGGTC GGAGAAATGA CGGCGGCCGA ACGGGCAGCC
CTCGACGCGG CGATCACCGC CACCTACCAG GCGGCCGGAA TCAGCTCCGA CCCGCGCAGC
TGGAACCGGC CAGCACCCCT GCTGACCGAC CTGGCCGACC TGGCCGCAAC CCTGGCCAGC
TCCACCGGCC CGGCGGCGGC GCTCGCCGCC GGGCTGTACC CCTTCACCCA GGGGGCGTTC
TCCGGCCTGT TCGACGGCCC CACCAGCGCG CCCGGCGACG GCCGCCTGGT CGTCTACTCG
CTGCGCGACC TGCCCGACGA GCTCAAAGCC ATCGGCACCC TGCTCGTCCT GGACGCGATC
TGGCGGCGGG TGTCCAACCC CGCCGACCGC CGGCCCCGCC TGGTCATTGT CGACGAGGCG
TGGCTGCTCA TGCGCCAGCC CGCCGGTGCG GACTTCCTGT TCCGGATGGC GAAGTCCTCC
CGGAAGTACT GGGCCGGGCT CACCGTCGCC ACCCAAGACA CGGCTGACGT GCTGGCCACC
GATCTCGGCC GGGCGATCGT CACGAACGCC GCCACCCAGA TCTTGCTACG GCAGGCACCG
CAGGCGATCG ATGAGATCAC CGCCGTATTT GACCTGTCCC AGGGCGAACG GCAGTTCCTT
TTGGCCGCCG ACCGCGGACA GGGACTCCTC GCGGCCGGGG CACAGCGAGT CGCCTTCCAG
TCCCTCGCCT CCGGGGTCGA ACACGCCCTG ATCACGACAA ACCCGGCCGA ACTCGCGGCA
GACACCGATG GGGCCGACGA CGGCTTCTTC GATCTCGCCA TATCAGACGA CCCGATCGAT
CCCGACGGTC AGATCTACCT CGACCCCGCC TGA
 
Protein sequence
MSRRSRRRTS AQAPGPSAGP LDAAAAAFVP DALSIAPRHL DVGGDFLATM AITGYPREVH 
AGWLAPLVTY PGRVDVAVHV EPIDPVTAAN RLRRQLAKLE SGRQLGDEKG RLVDPQVEAA
TEDAYDLSAR VARGEGKLFR LGLYLTVHAS SENELADEVA AVRALAASLL LDAKPVSYRS
LQGWVSTLPL GLDQVRMRRT FDTTALSAAF PFTSPDLPPP DPTSLAPTGV LYGLNVASNG
LVHWDRFGDV DNHNAVILGR SGAGKSYLVK LELLRSLYRG IEVHVVDPED EYARLAAAVG
GTYLHLGGDG VRINPFDLPI QTTPDGRRTA PRDALVRRSL FLHTVISVLV GEMTAAERAA
LDAAITATYQ AAGISSDPRS WNRPAPLLTD LADLAATLAS STGPAAALAA GLYPFTQGAF
SGLFDGPTSA PGDGRLVVYS LRDLPDELKA IGTLLVLDAI WRRVSNPADR RPRLVIVDEA
WLLMRQPAGA DFLFRMAKSS RKYWAGLTVA TQDTADVLAT DLGRAIVTNA ATQILLRQAP
QAIDEITAVF DLSQGERQFL LAADRGQGLL AAGAQRVAFQ SLASGVEHAL ITTNPAELAA
DTDGADDGFF DLAISDDPID PDGQIYLDPA