Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6729 |
Symbol | |
ID | 5675042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8183023 |
End bp | 8185515 |
Gene Length | 2493 bp |
Protein Length | 830 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245578 |
Product | Type IV secretory pathway VirD4 protein-like protein |
Protein accession | YP_001510969 |
Protein GI | 158318461 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.577691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.546954 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTTG CTGTTTCGGC CCCTCCGCCG GGGCTTCCCC CCGTCCCTCC TCCACCCCCT CCACCGGCTC CGGAGCTGCC GGTCTGGCTC ACCGACCCCA GCCGGATCGT CTCTGGCTTG GAGTCGTGGC TGGCCGCCCA CGCCGACTGG TGGCCGGTCG CTGTCCTCGA GCTCGTTCTC CTGCTGGCCG CGGGGTACGC CCGCCGCCGG GTTCGCGCCC ATCGGCATGT GGTGCTGTGT GAGGGGGCGC GGACGGTGGA GATCCTCACC CCGCCCGAGG TATCCGCCCA CGCGGCGGAG ATCTTCTGGG GTCAGATGGG CGGCCTGCAA CGGGCGCGCT GGGACCGGCT CCTGCATGGC CAGCCGCATC TGGGCTGGGA ACTGCTCGCC ACCCGCGCGG GGACGGTGAT CCGGCTGTGG GTCCCCGGCC CGGTGCCGCC GGGCATGGTC GAGCGGGCGG TGCAGGCCGC GTGGCCCGGT GCCCGCACTA CGACCCGGCC TGCCGCCGCG CCGCTGCCGG ACTACGCGCT GGCGATCGGC GGGCAGCTAC GTCTGGCCCA GGTCGACGTG CTGCCGCTAC GCGCGGACCC GTCCGGGGAC CCGATCCGGT CCCTGCTGTC TGCCGCCTCC GAACTCGACG ACAACGAGGC GGCGGTGGTG CAGCTGCTGG CCCGGCCGGT CACGGGCCGC CGGCTCACGC TCGCCCGCCG CGCCGCTGCC CGCCAGCGGG GCCAGTACGC CCCGACCCTG CTCAGCCGCG CCCTCGACCT GATCACCCCC CACGCCGGGC CCCGCCCGGC CGCCGGGCAG GTGGTGGGCA AGACGGGGCC GACACCGCCG GAGGACTCCG CGGCGTCCCG CGCAATCGGC CTCAAAGCAG TCGGCCCCCG CTGGGAAGCC ACGGTCACCT ACGCCGCAGC TCACCTCGCC CCGCCGATGA CCAAGACCGG GCAGCAGGCC GCCACGACGA TGCTGCGCGG CCGGGCACAC GCCCTCGCCT CCGCGTTCGC GTTGTACGCC GGGCACAACT ATCTGCGCCG CCTGAAACTG CCCCACCCCA TCCCCGTGCT GGCGGGCCGC CGGCTGCGGC GCGGGGACCT GCTCGCGGTC GCCGAACTCG CCGCCCTCGC CCACCTGCCC CTGGACACCG CTGTCCCGGG GCTGTCCCGG GCCGGGGCCG CGGCCGTCGC GCCCCCGGCC GGGATCCCCG AAGCCGGACC CCGGGTCAAG CCGCTAGGGG AGAGCGAAGC GGGCCGGCGC CGCCGGGTCG GGTTGAACGT CGCCGACGCC CGCCATCACG TGCACGTCGT CGGCGCGACC GGGTCGGGGA AATCGACGCT GCTCGCGAAC ATGATCCTCG CCGATGCGGA AGCCGGCCGG GGGCTGGCGG TCTTCGACCC GAAAGGCGAC CTGGTCAACG ACGTCCTCGC CCGCCTCCCC GCGGACGCCG CTGACCGGGT CGTGCTCCTC GACCCCGAGG ACGCTGCCGC CCCACCCTGC TTGAACATCC TCGACGGCGG CGACGCCGAT CTGGCCACCG ACCAGCTGGT CGGGATCTTC CGGCGGATCT GGGCCGACTC CTGGGGGCCG CGGACCGATG ACCTGCTGCG CGCCACCTGC CTGACCCTGC TGGAACGGCG CACCCGCACC GGGATCACCC CGACGTTGGG CGACGTCGTC AAGGTCCTCA CCGAACCGGA CACACGACGT AAAGCCACCA CCGGAGTCAC CGACCCGATC CTCGCCAGCT TCTGGGAGTG GTACGGGCAG CTGTCTGACG GTGCCCGCGC GGCGGCGATC GGCCCGATCC TGAACAAACT CCGCGCCCTG CTGCTCCGCA CCTTCGCCCG CCAAGCCCTG GCCGCCGGCC CGTCCACCGT CGACCTACGC GAGGTCCTCG ACCGCGGCGG GATCCTCCTC GTACGCATCC CCAAAGGCGT GATCGGGGAG GACGCCTCCC GGATCGTCGG GTCGATCGTG TTGGCGAAGA TCTGGCAGAC CGTCCTACAC CGGGCCCGCC TGCACCCCGA CCAGCGCCCC GACGCCACCT GTTTTTTGGA CGAAGCCCAA AACTTCCTCA CGCTGCCGGG GGCGGTGGAG GACATGCTCG CCGAGGCCCG CGGCTACCGG CTGTCCATGA CCCTGGCCCA CCAGCACCTG CGTCAACTCC CCGACGATTT GGCCGACGCC CTGTCCACCA ACGCGCGCAG CAAACTGTTC TTCGGCGTCA GCCCGAAAGA CGCCGCGGAC CTGTCCCGGC ATGTCAGCCC CGTCCTGACC CAGCATGACC TCGCCCGGCT CCCCGCGTGG ACGGCCGCAG CCCGGCTGGT GGTCGACCAG GCCGACACCG CGGCGTTCAC CCTGCGGACC CGGCCTCTGC CACCACCTGT CGCTGGCCGC GCTGATGCGC TGCGGGTTGC GGCACGCCGG CACACCGGCG CACCCGATGC CGGGCGCGGC CCCCGGTCCG GTCAGGGCGG GGGTGGCCAG TGA
|
Protein sequence | MIVAVSAPPP GLPPVPPPPP PPAPELPVWL TDPSRIVSGL ESWLAAHADW WPVAVLELVL LLAAGYARRR VRAHRHVVLC EGARTVEILT PPEVSAHAAE IFWGQMGGLQ RARWDRLLHG QPHLGWELLA TRAGTVIRLW VPGPVPPGMV ERAVQAAWPG ARTTTRPAAA PLPDYALAIG GQLRLAQVDV LPLRADPSGD PIRSLLSAAS ELDDNEAAVV QLLARPVTGR RLTLARRAAA RQRGQYAPTL LSRALDLITP HAGPRPAAGQ VVGKTGPTPP EDSAASRAIG LKAVGPRWEA TVTYAAAHLA PPMTKTGQQA ATTMLRGRAH ALASAFALYA GHNYLRRLKL PHPIPVLAGR RLRRGDLLAV AELAALAHLP LDTAVPGLSR AGAAAVAPPA GIPEAGPRVK PLGESEAGRR RRVGLNVADA RHHVHVVGAT GSGKSTLLAN MILADAEAGR GLAVFDPKGD LVNDVLARLP ADAADRVVLL DPEDAAAPPC LNILDGGDAD LATDQLVGIF RRIWADSWGP RTDDLLRATC LTLLERRTRT GITPTLGDVV KVLTEPDTRR KATTGVTDPI LASFWEWYGQ LSDGARAAAI GPILNKLRAL LLRTFARQAL AAGPSTVDLR EVLDRGGILL VRIPKGVIGE DASRIVGSIV LAKIWQTVLH RARLHPDQRP DATCFLDEAQ NFLTLPGAVE DMLAEARGYR LSMTLAHQHL RQLPDDLADA LSTNARSKLF FGVSPKDAAD LSRHVSPVLT QHDLARLPAW TAAARLVVDQ ADTAAFTLRT RPLPPPVAGR ADALRVAARR HTGAPDAGRG PRSGQGGGGQ
|
| |