Gene Franean1_6728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6728 
Symbol 
ID5675041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8180935 
End bp8182908 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content74% 
IMG OID641245577 
ProductType IV secretory pathway VirB4 protein-like protein 
Protein accessionYP_001510968 
Protein GI158318460 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.726364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.776114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC GCCGCAACCG CCTACACGTC GCCCCGCAGG GTGCGCCGTC CCGCAGCCGC 
CGGCCCCTGG TCCCCGGCCG CGGGCAAGGG GCGGGGCAGA TGCTGCCCGT CGAGGGCCCG
GCGCTGTTCA CCCCGCCCGC GCTGGTGGTC GATGCCGGGC AGATCGAGGT CAGTGGGATC
TGCGCGACCA CGATCACCGT GGTCGGCTAC CCGCGTGAAG TCGGACCCGG GTGGATGGAG
CCGCTGCTGG CCTACCCGGG CCGCCTCGAT GTCGCCCTGC ACATCGACCC GACCCCACCC
GCGGTGGCGG CGCTGCGACT ACGCCGCCAA CTCGGCCGGC TCGAATCCGG CCGCCGCGCC
GACGCCGCGG CCGGCCGTCT CGCCGACCCC GAGCTCGACG CCGCCGCCCA AGACGCCGGG
GAGCTGGCCC GTCAGGTCGC TCGTGGTGAG GCACGCCTGT TCCGGACCGG GCTGTACCTG
ACCGTCTACG CCGATAGCCG CGAGGAGCTC GCCGAGGAAG CGGCGCGGGT CACCGCGCTG
GCGCACAGTC TGCTGCTCAC GGTGCGCCGG GCCCGGTACC GGAGCGTGCA GGGCTGGGTG
AGCACCCTGC CCCTGGGCCT GGACCTGCTC CAGATCCGCC GGGCGATGGA CACCCAGGCG
CTGGCCGCGG GGATCCCGTT CACCACCCCT GACCTGCCCC TACCCGACGT GGAACGCCCC
GGAGCCGCAC CCGTCGTCTA CGGCACGAAC CTGCACTCCG CCGGGATCGT GGCGCATGAC
CGGTGGGCGC AGCCGAACTA CAACTCCGTC ACCACCGGCG CCTCCGGCGC CGGCAAGAGC
TTCCTGATGA AACTCGACCT CCTGCGCTCC CTCTACCAGG GCGTCGAGGC CGCCATTGTT
GATCCGGAGG ACGAGTACGG CCGGCTCGCC GCCGCTGTCG GGGGCACCCG CCTCGCGCTC
GGCGAGCCTG GGGTGCACCT CAACCCCCTC GACCTGCCCG CCCACTCCCA TCACGACCCC
GACCTCCTCA CCCGCCGCGC CCTGTTCTGC CACACCCTGA TCACCACCCT GCTCGCCGGC
ACCGATGACG ACAGCGGGCT GGGAGCCGGG GGCCGGGCCG CGTTGGACGC GGCGATCCTG
AGCGCCTACC GCGCTGCCGG GATCACCCAC GACCAGGCCA CCTGGACCCG GCCCGCCCCG
CTGCTCGCCG ACATCGCCGC CGCCCTCACC GCCGCTGAGG ACCCGGCCGG CCCAGCACTC
GCCGCACGTC TCGCCCCGTT CGTCACCGGC TCGCATGCTG GCCTGTTCGC GCACGCGACC
ACCACCAGCC CGACCGGGCA CCTTGTCGTC TACTCCCTGC GGGCGCTGCC CGACGAGCTG
AAAGCCGCCG GGACGCTGCT CACCCTGGAC GCGATCTGGC GGACCGTCGC CGACCCGGCC
CGGCGACGGC GCCGTCTCGT GGTGGTGGAC GAGGGCTGGC TGCTCATGGC ACACCCCGCC
GGCGCCCGCT TCCTGTTCCG CCTCGCGAAG GCCGCCCGGA AGCACTGGGC CGGTCTGGCC
GTGGCGACCC AGGACTCCGC CGACCTCCTC AGCTCCGAGC TGGGCCGGGC GGTCGTCGCG
AACGCGGCGA CGCAGATCCT GTTGCGCCAG GACCCGACCG TGATCGACGA CCTGCGTCGC
GTGCACCGGC TCACCGATGG CGAAGCCACC CAGCTGCTCA CCGCCGGTCC CGGTGACGCC
CTGCTCCTGA CTGGGACCGG GCAGCGCACC GCCCTGCACG CCCTCGCGTC CCCCGCCGAA
TACGACCTGA TCACCACAGA TCCGGTCGAC ACCACCACCG CCACTCCCAC CGACACGGGC
CCACTCGACC CGGCCTGGGC CGAGGACCCT GCCGTCGCGG CGACCGGGCC GGCCCGCGCT
CCTCGCCGGC CGGCCGCCCG GCGGCCGGTG GACGACGACG CCGACCCGTT CTAA
 
Protein sequence
MSRRRNRLHV APQGAPSRSR RPLVPGRGQG AGQMLPVEGP ALFTPPALVV DAGQIEVSGI 
CATTITVVGY PREVGPGWME PLLAYPGRLD VALHIDPTPP AVAALRLRRQ LGRLESGRRA
DAAAGRLADP ELDAAAQDAG ELARQVARGE ARLFRTGLYL TVYADSREEL AEEAARVTAL
AHSLLLTVRR ARYRSVQGWV STLPLGLDLL QIRRAMDTQA LAAGIPFTTP DLPLPDVERP
GAAPVVYGTN LHSAGIVAHD RWAQPNYNSV TTGASGAGKS FLMKLDLLRS LYQGVEAAIV
DPEDEYGRLA AAVGGTRLAL GEPGVHLNPL DLPAHSHHDP DLLTRRALFC HTLITTLLAG
TDDDSGLGAG GRAALDAAIL SAYRAAGITH DQATWTRPAP LLADIAAALT AAEDPAGPAL
AARLAPFVTG SHAGLFAHAT TTSPTGHLVV YSLRALPDEL KAAGTLLTLD AIWRTVADPA
RRRRRLVVVD EGWLLMAHPA GARFLFRLAK AARKHWAGLA VATQDSADLL SSELGRAVVA
NAATQILLRQ DPTVIDDLRR VHRLTDGEAT QLLTAGPGDA LLLTGTGQRT ALHALASPAE
YDLITTDPVD TTTATPTDTG PLDPAWAEDP AVAATGPARA PRRPAARRPV DDDADPF