Gene Francci3_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4026 
Symbol 
ID3906987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4812352 
End bp4814061 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content66% 
IMG OID637881355 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_483105 
Protein GI86742705 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGA CTGCCGCGAC CGAGACCTAC CGCACTGCCC GCGATCTTCT GATCAACTTG 
CGCACGGACT ACGGCAAGGC GCTGGAGGAG TTCCGCTGGC CACGGTTCGA AGGCCAGTTC
AACTGGGCCA TCGACTGGTT TGACCCAATC GCCCGGAACA ACGACCGGGT GGCGCTGTGG
ATCGTGGAAG AGGATGGTTC GGAACGTCGG TGCACCTATG ACGAGATGGC CCGTCGCTCG
GACCGGGTAG CGACCTGGCT TGCCGGGTTA GGCATCGGCA AGGGCGATCC GGTGATCCTC
ATGCTCGGCA ACCAGGTGGA GCTCTGGGAG TCGATGCTGG CGCTCATGAA GCTGGGCGCG
GTAATCATAC CCACCACTAC CGCGATCGGT CCGACGGACC TCGCCGACCG GATCGAACGC
GGCGGCGCCA CCTGCGTGAT CGCCAATGCC GCCGACGCGG TGAAGGTCAA GCTGAAAAAC
CTGAACGGTG TCATTGTAGG TGGCGAGGCC GCCGGCTGGC GCCCCTACAC CGAGGCCAAC
GGCGTCACCG AGGTGCACCG GTTCGAATCC CGTACCGCCC CCACCGATCC GCTGCTGTTC
TACTTCACCT CCGGAACCAC CAGCCGGCCC AAACTCGTCG AGCACAGCCA GGTGTCCTAT
CCGGTCGGGC ACCTGTCCAC CCTGTACTGG ACCGGGGTGC AACCCGGTGA CGTGCATCTC
AACATCAGTT CCCCTGGCTG GGCAAAGCAC GCGTGGAGCT CGTTTTTCGT GCCGTGGATC
GCCGAAGCAA CGATCTTCGT CTACAACTAC GGCACGTTCG ACCCCGCCAA GCTGCTGGCG
CAGCTCCGCC GGGCTGGCGT AACCACGATG TGCGCGCCGC CGACCGTGTG GCGCATGCTC
ATCAAGGTCG ATCTTTCCGG CGGGCCCGGC GCGCTGCGCG AGGTGCTGTC GGCCGGTGAA
CCCCTCAATC CGGAGGTCAT CGACCAGGTC CGCGCCCACT GGGGCCTGAC CCTGCGCGAC
GGGTTCGGCC AAACCGAGAC GACCGCCCAG GTCGGCAACT CACCTGGCGC CGCCGTCAAG
CCGGGGTCGA TGGGCCGTCC GCTGCCGGGC GTGCCCACGG TCCTGGTCGA CCCGGTGAGC
GGCCAGCGCT CCAGCACTGA GGGAGAGCTG TGCCTCGATC TGGCCGCGCA CCCGCTCGCG
CTAATGACCA GTTACCGTGG CGACCCCGAA CGCAACGCGG AAGTACTGGC CGGCGGCTAC
TACCACACCG GTGACGTCGC ATCCCTCGAC GAGGACGGCT ACCTCACCTA CATCGGCCGT
ACCGACGACG TGTTCAAGGC CTCGGACTAC AAGGTGTCCC CCTTCGAACT GGAAAGCGTC
CTCGTCGAAC ACCCGGCTGT GCTCGAGGCA GCCGTCGTGC CCGCACCCGA CGAGGTACGC
CTGGCCGTGC CGAAGGCCTA CATCGCACTT GCTCCGGGAT GGGAGCCGAA CCGCGAGACA
GCGGAGGCGA TCCTCCGTCA CGCCCGCGAA AACCTTGCCC CCTACCTGCG CGTCCGGCGG
CTGGAGTTCT ACGACCTACC CAAAACGATC TCCGGCAAGA TCCGGCGCGT CGAGCTGCGC
AGCAGGGAGA GCGAAGCCGC GGGCACCCGC CTCAGCATGG AGTACCGCGA CACCGACTTT
CCCAGGCTGA CCAGCCGTAA GGCCGACTGA
 
Protein sequence
MTSTAATETY RTARDLLINL RTDYGKALEE FRWPRFEGQF NWAIDWFDPI ARNNDRVALW 
IVEEDGSERR CTYDEMARRS DRVATWLAGL GIGKGDPVIL MLGNQVELWE SMLALMKLGA
VIIPTTTAIG PTDLADRIER GGATCVIANA ADAVKVKLKN LNGVIVGGEA AGWRPYTEAN
GVTEVHRFES RTAPTDPLLF YFTSGTTSRP KLVEHSQVSY PVGHLSTLYW TGVQPGDVHL
NISSPGWAKH AWSSFFVPWI AEATIFVYNY GTFDPAKLLA QLRRAGVTTM CAPPTVWRML
IKVDLSGGPG ALREVLSAGE PLNPEVIDQV RAHWGLTLRD GFGQTETTAQ VGNSPGAAVK
PGSMGRPLPG VPTVLVDPVS GQRSSTEGEL CLDLAAHPLA LMTSYRGDPE RNAEVLAGGY
YHTGDVASLD EDGYLTYIGR TDDVFKASDY KVSPFELESV LVEHPAVLEA AVVPAPDEVR
LAVPKAYIAL APGWEPNRET AEAILRHARE NLAPYLRVRR LEFYDLPKTI SGKIRRVELR
SRESEAAGTR LSMEYRDTDF PRLTSRKAD