Gene Franean1_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2548 
Symbol 
ID5670942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3028956 
End bp3030845 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content71% 
IMG OID641241464 
ProductType IV secretory pathway VirB4 protein-like protein 
Protein accessionYP_001506884 
Protein GI158314376 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0379852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAC GAACCCGACG CCGCGCCTCC ATGCAGGCCC ACAGCCCGTC GACACAGCCG 
GTGAACGCCG CCGCGGCGGC GTTCGTCCCG GACGCACTCT CGATCGCCCC CCGCCATCTC
GACGTCGGTG GGGACTTCCT CGCCACCATG GCCATCACCG GCTATCCCCG CGAAGTCCAC
GCCGGCTGGC TCGCCCCGCT GCTGACCTAC CCTGGCCGGG TCGACGTCGC CGTGCACGTC
GAGCCGATCG ACCCGGTCAC CGCCGCGAAC CGGCTCCGCC GGCAGCTGTC GAAGCTGGAG
TCCGGCCGCC AGCTCGGCGA CGAGAAAGGC CGGCTGATCG ACCCGCAGGT GGAGGCGGCA
ACCGAGGACG CCTACGACCT GTCCGCCCGC GTCGCCCGCG GCGAAGGCAA GCTCTTCAGG
CTTGGTCTGT ACCTCACCGT CCACGCGGGC AGCGAAACCG AGCTCGCCGA CGAGGTCGCC
GCTGTCCGTG CGCTGGCCGC CAGCCTGTTG TTGGACGCCA AACCGACCAG CTACCGGTCC
CTGCAAGGCT GGGTCAGCAC CCTGCCCTTG GGCCTAGACC AGGTACGGAT GCGCCGCACC
TTCGACACCG CAGCGTTGAG TGCTGCGTTC CCGTTCACCA GTCCGGACCT GCCGCCCGCC
GACCCCACGT CGTTGGCGGC GACCGGGGTG CTCTACGGGC TCAACGTCGC CAGCAACGGG
CTGGTGCACT TCGACCGCTT CGGCGACGTC GACAACCACA ACGCCGTGCT CTTCGGTCGT
AGCGGCGCGG GGAAGAGCTA TCTGGCCAAG CTCGAACTGT TGCGCTCGCT GTACCGGGGC
ATCGAGGTCC ACGTCGTCGA TCCCGAAGAC GAATACGCCC GACTCGCCAC CGCGGTCGGC
GCGACCTATC TGCACCTTGG CGCCGACAAC GTGCGGGTCA ACCCGTTCGA CCTGCCGATC
CAGACCACCC CCGACGGGCG GCGGACAGCA CCCCGTGACG CCCTGGTGCG CCGCAGCCTG
TTCCTGCACA CCGTCGTCGC GGTGCTCGTC GGTCAGCTGT CCGCGGCTGA ACGGGCAGTC
CTCGACGTCG CGATCACCGC CACCTACCAG ACGGCGGGGA TCAGCTCCGA CCCACGCACC
TGGAGCCGAC CGGCACCGCT GCTGGCCGAC CTCGCCACGA CCCTGGCCGC CTCCGACGAC
CCGGCGGCAG TGACCCTCGG TGCCCGGCTG CACCCGTACA CGGCAGGGGC GTTCTCCGGC
CTGTTCGACG GCCCTACCAG TGCGCCTGGC GACGGCCACC TCGTCGTCTA CTCCCTGCGC
GATCTGCCCG ACGAACTCAA AGCCATCGGC ACGCTGCTCG TCCTCGACGC CGTGTGGCGG
CGGGTGTCCA ACCCCGCCGA CCGCCGACCC CGCATGGTCG TGGTCGACGA GGCGTGGCTG
CTGATGCGCC AACCGGCCGG TGCGGACTTC CTGTTCCGGA TGGCCAAGAG CGCGCGGAAG
TATTGGGCCG GGCTGACCGT CGCGACCCAG GACACCGCCG ACGTGCTCGC CACCGACCTG
GGCAAGGCGA TCGTCACGAA CGCCGCCACC CAGATCCTGC TGCGCCAGGC ACCGCAGGCG
ATCGACGAGA TCACCGCCGT GTTCGACCTG TCCCAGGGCG AACGGCAGTT CCTGCTCTCC
GCCGACCGCG GACAGGGACT CCTCGCGGCG GGGGCACAGC GGGTCGCCTT CCAGGCCCTG
GCCTCGCCCA GCGAGCACCG CCTGGTCACG ACCAACCCCG CCGAACTCGC CGCCGACCCC
GACGAGGCCG GCGACGACGG CTTCTTCGAC CTCGCCGCGC CCGCTGGCCC GGCCGATGAC
GACGGCCAGA TCTACCTCGA CGCCGCCTGA
 
Protein sequence
MSRRTRRRAS MQAHSPSTQP VNAAAAAFVP DALSIAPRHL DVGGDFLATM AITGYPREVH 
AGWLAPLLTY PGRVDVAVHV EPIDPVTAAN RLRRQLSKLE SGRQLGDEKG RLIDPQVEAA
TEDAYDLSAR VARGEGKLFR LGLYLTVHAG SETELADEVA AVRALAASLL LDAKPTSYRS
LQGWVSTLPL GLDQVRMRRT FDTAALSAAF PFTSPDLPPA DPTSLAATGV LYGLNVASNG
LVHFDRFGDV DNHNAVLFGR SGAGKSYLAK LELLRSLYRG IEVHVVDPED EYARLATAVG
ATYLHLGADN VRVNPFDLPI QTTPDGRRTA PRDALVRRSL FLHTVVAVLV GQLSAAERAV
LDVAITATYQ TAGISSDPRT WSRPAPLLAD LATTLAASDD PAAVTLGARL HPYTAGAFSG
LFDGPTSAPG DGHLVVYSLR DLPDELKAIG TLLVLDAVWR RVSNPADRRP RMVVVDEAWL
LMRQPAGADF LFRMAKSARK YWAGLTVATQ DTADVLATDL GKAIVTNAAT QILLRQAPQA
IDEITAVFDL SQGERQFLLS ADRGQGLLAA GAQRVAFQAL ASPSEHRLVT TNPAELAADP
DEAGDDGFFD LAAPAGPADD DGQIYLDAA