Gene Franean1_6729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6729 
Symbol 
ID5675042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8183023 
End bp8185515 
Gene Length2493 bp 
Protein Length830 aa 
Translation table11 
GC content74% 
IMG OID641245578 
ProductType IV secretory pathway VirD4 protein-like protein 
Protein accessionYP_001510969 
Protein GI158318461 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.577691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.546954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTTG CTGTTTCGGC CCCTCCGCCG GGGCTTCCCC CCGTCCCTCC TCCACCCCCT 
CCACCGGCTC CGGAGCTGCC GGTCTGGCTC ACCGACCCCA GCCGGATCGT CTCTGGCTTG
GAGTCGTGGC TGGCCGCCCA CGCCGACTGG TGGCCGGTCG CTGTCCTCGA GCTCGTTCTC
CTGCTGGCCG CGGGGTACGC CCGCCGCCGG GTTCGCGCCC ATCGGCATGT GGTGCTGTGT
GAGGGGGCGC GGACGGTGGA GATCCTCACC CCGCCCGAGG TATCCGCCCA CGCGGCGGAG
ATCTTCTGGG GTCAGATGGG CGGCCTGCAA CGGGCGCGCT GGGACCGGCT CCTGCATGGC
CAGCCGCATC TGGGCTGGGA ACTGCTCGCC ACCCGCGCGG GGACGGTGAT CCGGCTGTGG
GTCCCCGGCC CGGTGCCGCC GGGCATGGTC GAGCGGGCGG TGCAGGCCGC GTGGCCCGGT
GCCCGCACTA CGACCCGGCC TGCCGCCGCG CCGCTGCCGG ACTACGCGCT GGCGATCGGC
GGGCAGCTAC GTCTGGCCCA GGTCGACGTG CTGCCGCTAC GCGCGGACCC GTCCGGGGAC
CCGATCCGGT CCCTGCTGTC TGCCGCCTCC GAACTCGACG ACAACGAGGC GGCGGTGGTG
CAGCTGCTGG CCCGGCCGGT CACGGGCCGC CGGCTCACGC TCGCCCGCCG CGCCGCTGCC
CGCCAGCGGG GCCAGTACGC CCCGACCCTG CTCAGCCGCG CCCTCGACCT GATCACCCCC
CACGCCGGGC CCCGCCCGGC CGCCGGGCAG GTGGTGGGCA AGACGGGGCC GACACCGCCG
GAGGACTCCG CGGCGTCCCG CGCAATCGGC CTCAAAGCAG TCGGCCCCCG CTGGGAAGCC
ACGGTCACCT ACGCCGCAGC TCACCTCGCC CCGCCGATGA CCAAGACCGG GCAGCAGGCC
GCCACGACGA TGCTGCGCGG CCGGGCACAC GCCCTCGCCT CCGCGTTCGC GTTGTACGCC
GGGCACAACT ATCTGCGCCG CCTGAAACTG CCCCACCCCA TCCCCGTGCT GGCGGGCCGC
CGGCTGCGGC GCGGGGACCT GCTCGCGGTC GCCGAACTCG CCGCCCTCGC CCACCTGCCC
CTGGACACCG CTGTCCCGGG GCTGTCCCGG GCCGGGGCCG CGGCCGTCGC GCCCCCGGCC
GGGATCCCCG AAGCCGGACC CCGGGTCAAG CCGCTAGGGG AGAGCGAAGC GGGCCGGCGC
CGCCGGGTCG GGTTGAACGT CGCCGACGCC CGCCATCACG TGCACGTCGT CGGCGCGACC
GGGTCGGGGA AATCGACGCT GCTCGCGAAC ATGATCCTCG CCGATGCGGA AGCCGGCCGG
GGGCTGGCGG TCTTCGACCC GAAAGGCGAC CTGGTCAACG ACGTCCTCGC CCGCCTCCCC
GCGGACGCCG CTGACCGGGT CGTGCTCCTC GACCCCGAGG ACGCTGCCGC CCCACCCTGC
TTGAACATCC TCGACGGCGG CGACGCCGAT CTGGCCACCG ACCAGCTGGT CGGGATCTTC
CGGCGGATCT GGGCCGACTC CTGGGGGCCG CGGACCGATG ACCTGCTGCG CGCCACCTGC
CTGACCCTGC TGGAACGGCG CACCCGCACC GGGATCACCC CGACGTTGGG CGACGTCGTC
AAGGTCCTCA CCGAACCGGA CACACGACGT AAAGCCACCA CCGGAGTCAC CGACCCGATC
CTCGCCAGCT TCTGGGAGTG GTACGGGCAG CTGTCTGACG GTGCCCGCGC GGCGGCGATC
GGCCCGATCC TGAACAAACT CCGCGCCCTG CTGCTCCGCA CCTTCGCCCG CCAAGCCCTG
GCCGCCGGCC CGTCCACCGT CGACCTACGC GAGGTCCTCG ACCGCGGCGG GATCCTCCTC
GTACGCATCC CCAAAGGCGT GATCGGGGAG GACGCCTCCC GGATCGTCGG GTCGATCGTG
TTGGCGAAGA TCTGGCAGAC CGTCCTACAC CGGGCCCGCC TGCACCCCGA CCAGCGCCCC
GACGCCACCT GTTTTTTGGA CGAAGCCCAA AACTTCCTCA CGCTGCCGGG GGCGGTGGAG
GACATGCTCG CCGAGGCCCG CGGCTACCGG CTGTCCATGA CCCTGGCCCA CCAGCACCTG
CGTCAACTCC CCGACGATTT GGCCGACGCC CTGTCCACCA ACGCGCGCAG CAAACTGTTC
TTCGGCGTCA GCCCGAAAGA CGCCGCGGAC CTGTCCCGGC ATGTCAGCCC CGTCCTGACC
CAGCATGACC TCGCCCGGCT CCCCGCGTGG ACGGCCGCAG CCCGGCTGGT GGTCGACCAG
GCCGACACCG CGGCGTTCAC CCTGCGGACC CGGCCTCTGC CACCACCTGT CGCTGGCCGC
GCTGATGCGC TGCGGGTTGC GGCACGCCGG CACACCGGCG CACCCGATGC CGGGCGCGGC
CCCCGGTCCG GTCAGGGCGG GGGTGGCCAG TGA
 
Protein sequence
MIVAVSAPPP GLPPVPPPPP PPAPELPVWL TDPSRIVSGL ESWLAAHADW WPVAVLELVL 
LLAAGYARRR VRAHRHVVLC EGARTVEILT PPEVSAHAAE IFWGQMGGLQ RARWDRLLHG
QPHLGWELLA TRAGTVIRLW VPGPVPPGMV ERAVQAAWPG ARTTTRPAAA PLPDYALAIG
GQLRLAQVDV LPLRADPSGD PIRSLLSAAS ELDDNEAAVV QLLARPVTGR RLTLARRAAA
RQRGQYAPTL LSRALDLITP HAGPRPAAGQ VVGKTGPTPP EDSAASRAIG LKAVGPRWEA
TVTYAAAHLA PPMTKTGQQA ATTMLRGRAH ALASAFALYA GHNYLRRLKL PHPIPVLAGR
RLRRGDLLAV AELAALAHLP LDTAVPGLSR AGAAAVAPPA GIPEAGPRVK PLGESEAGRR
RRVGLNVADA RHHVHVVGAT GSGKSTLLAN MILADAEAGR GLAVFDPKGD LVNDVLARLP
ADAADRVVLL DPEDAAAPPC LNILDGGDAD LATDQLVGIF RRIWADSWGP RTDDLLRATC
LTLLERRTRT GITPTLGDVV KVLTEPDTRR KATTGVTDPI LASFWEWYGQ LSDGARAAAI
GPILNKLRAL LLRTFARQAL AAGPSTVDLR EVLDRGGILL VRIPKGVIGE DASRIVGSIV
LAKIWQTVLH RARLHPDQRP DATCFLDEAQ NFLTLPGAVE DMLAEARGYR LSMTLAHQHL
RQLPDDLADA LSTNARSKLF FGVSPKDAAD LSRHVSPVLT QHDLARLPAW TAAARLVVDQ
ADTAAFTLRT RPLPPPVAGR ADALRVAARR HTGAPDAGRG PRSGQGGGGQ