Gene Franean1_1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1381 
Symbol 
ID5669789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1671183 
End bp1673153 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content73% 
IMG OID641240307 
ProductType IV secretory pathway VirD4 protein-like protein 
Protein accessionYP_001505734 
Protein GI158313226 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGG AACGCGGCCG ATTCGGTGAT GCCGGCTTGG ACGCGGAGAT CGCAGTGGTC 
GCCGTGGCGC TCCTCGTCGC GTCGTCGGCG GCGGCAGTCT GGGCAGGCGC GCAGCTGGCC
GCGTTGGCGT TCGGTGCGCA CCGCCCCCTC GACGTGGGCC TGGGCGACGG GCTGGCCGCG
CTACCTCAGC TGATCGCCGA CCCTGCCCAT CCGGCGCACG CCTGGCCGGC GCGCGCCGCA
GGCGACCTGC CGGGGCAGCT CGGGTACTGG GCGGCGACCG TGGTGCCGCT CGCGGTACTG
GCCATGCTGA GCGGGCTCGC GGCCTTCCTG CTCGTCGGCG ACGGGGTGGG CGTGGCGCGC
CGGCGCCGGA TGGGTATCGA TCCCGAAGCC CGGTTCGCCC GCCCGGCGGA TCTGGCACCG
CTGTGGCTGG GCGGGCCGAC CCGCGGTCGG ATGATCCTGG GCCGGATCGG TGGCCCCCGG
GGACGGCTCG TCGCCACCGA GGACACCAAC CGTCCACTGG ACGCGGCGGT GCCCAGATGG
CGAGCACGAC GGGCAGCCCG TCGCCGCGGG CAGCGCGGCA GCGTGATCGT GCTAGGCCCG
TCCCAGTGCG GCAAGACCGC CGCCCTGGCG ATCCCCGCCA TCCTGGAATG GGACGGCCCG
CTGATCGCCC TGAGTGTGAA GAACGACCTG CTGGGTGCGA CGATCTCCCG CCGTCGGCAG
GTCGGCGACG TCGCGGTGTT CGACCCGGCG GGCGTCACCG GCGAACTCGG CGCGCCCTGG
TCCCCGCTGG GTGCGGCGCG CACCCTGGCC GGCGCGCGCC GCGCCGCCCG CTCGATCGCG
AACGCCACCT CCTGGACGTC GGCCTCGTCG GGCGACATGG GCTTCTGGAC CGCGGCGGCC
GAGGACCTCC TCGGCCAGCT GTTCTGGACC GCCGCCGTCG TCGACCTGGG CATGGACACC
GTCGTGAGCT GGGTGGTGTC CATGGACAAG GAGACCGTCC GCGGCCTGCT GACCCCGCTG
GCCTCCCACC GCGACCCGAC ACTCGCCGCG GACGGTACGC AGGTCCTCGC CGGCTTCCAG
GGGATCTGGG CGAACGACCG CCGGCAGATC TCCTCCACCT ACCTGGTGGC CCGGCAGATG
ATCCAACCCT GGCAGGAACC GGAGATCGCT GCCTCCGCCA CGGCGTCGCA TCTTGATCTT
GAATGGCTCC TCGACACCGG CCCCGACGGG CAGGCTGCGA ACACGCTGTA CCTGAGCGCG
GATCTCGACG ACGCAGAACG CCTGGCCCCC GTGCTCGGTG GCCTGCTCGA CGACCTGATG
CGCCAGGCCT ACAGCCACGT CGGGCGAACC GGCGTCCCGT TGGACCCGCC GCTGCTGGTG
GTCGTGGACG AAGCCGGGAA CTGGCCGATG CGCAACCTAC CGGGACGGAT CTCCACCTGT
GCCGGCATCG GTATCCAGCT GGTGCTGGTG TATCAGAGCA AGGCGCAGAT CGACGCCGCC
TACGGCCCGA AAGCCGACAT CGTCATCTCC AACGCGGTCA CCAAGGTCTT CTTCGCCGGC
CAGTCCGACC GCTCCACGCT CGAGTACGCC GCCGGCCTGC TCGGCCAGGA GCATGTCGTC
CAGACGTCCA CCAACGTCGA CAGCACTGGT CTGGTCGGCC CGTCCGGCCG GCGCGGGGTC
TCGCGCAGCC CCACCCGGGT GGAGCTGCTG CCCTCCGCGC TGCTGCGGCA GGTCGCCCCC
GGCCAGGCGC TGCTCGTCCA CAACACCCTT CCGCCCGCCC ACCTGTTCGG CCGCTACTGG
TACCTGGACG AGGACCTGCA CGCACTCGCC ACCGGCCACC GGACCTCCCG ACGCGACCTG
GTCCGCCAGG CCGCCCGCCA GCGGATCACT CCCGGGAACA GCCCTGACCG GCCGCCGCCC
CCAACCGCCT CTGGGGATGA ACAATCGGCG GCTCCGTGGT GGCCCTGGTG A
 
Protein sequence
MSVERGRFGD AGLDAEIAVV AVALLVASSA AAVWAGAQLA ALAFGAHRPL DVGLGDGLAA 
LPQLIADPAH PAHAWPARAA GDLPGQLGYW AATVVPLAVL AMLSGLAAFL LVGDGVGVAR
RRRMGIDPEA RFARPADLAP LWLGGPTRGR MILGRIGGPR GRLVATEDTN RPLDAAVPRW
RARRAARRRG QRGSVIVLGP SQCGKTAALA IPAILEWDGP LIALSVKNDL LGATISRRRQ
VGDVAVFDPA GVTGELGAPW SPLGAARTLA GARRAARSIA NATSWTSASS GDMGFWTAAA
EDLLGQLFWT AAVVDLGMDT VVSWVVSMDK ETVRGLLTPL ASHRDPTLAA DGTQVLAGFQ
GIWANDRRQI SSTYLVARQM IQPWQEPEIA ASATASHLDL EWLLDTGPDG QAANTLYLSA
DLDDAERLAP VLGGLLDDLM RQAYSHVGRT GVPLDPPLLV VVDEAGNWPM RNLPGRISTC
AGIGIQLVLV YQSKAQIDAA YGPKADIVIS NAVTKVFFAG QSDRSTLEYA AGLLGQEHVV
QTSTNVDSTG LVGPSGRRGV SRSPTRVELL PSALLRQVAP GQALLVHNTL PPAHLFGRYW
YLDEDLHALA TGHRTSRRDL VRQAARQRIT PGNSPDRPPP PTASGDEQSA APWWPW